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IDENTIFICATION, PURIFICATION AND USES OF LECZYMES 

AND 

CARBOHYDRATE LIGANDS 

BACKQROTTWn ffF TffE INVRMTJ Ofl 

5 FTET.n HP THE TWKMTJft N 

This invention relates generally to the field of 
glycobiology. 

The major histocompatibility complex (MHO codes for 
10 a variety of gene products, many of which play a central 
role in the body's defense against pathogenic organisms. 
Such molecules include the classical transplantation 
antigens and structurally related molecules, proteins for 
transport of foreign peptides within cells, serum 
15 complement proteins, the lymphokines tumor necrosis a and 
tumor necrosis S, cytochromes and heat shock proteins. 

The classical transplantation antigens, encoded for by 
genes in the MHC, are a highly polymorphic group of 
molecules that were originally discovered for their role in 
20 determining rejection of foreign transplanted cells and 
tissue. An extensive body of experimental work has since 
supported a role for the classical transplantation antigens 
in self -recognition. In the currently accepted paradigm, 
hereinafter the "old paradigm" transplantation antigens 
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s rve to present peptides derived from both self and 
foreign proteins, for recognition by cells of the immune 
system. 



Two distinct groups of antigens, class I and class II 
5 antigens, are encoded by genes within the MHC. Class I 
antigens are expressed on virtually all nucleated cells in 
the body and play a role in the mediation of immune 
responses based on cytotoxic thymus -derived (T) lymphocyte 
mediated cell killing. Cytotoxic T lymphocytes play a role 

10 in killing of virus infected cells and tumor cells. The 
class I MHC molecule is composed of an approximately 45 
kiloDalton (kDa) heavy chain associated non-covalently with 
a 12 kDa protein known as microglobulin (SjM) . The old 
paradigm characterizes class I antigens as presenting 

15 peptide fragments derived from both self and foreign 
proteins synthesized endogenously within the cell. 

The class I molecules were discovered for their role 
in transplantation and were termed the "classical" class I 
molecules, to distinguish them from a later discovered 

20 group of class I molecules termed the "nonclassical " class 
I molecules. Genes encoding the nonclassical class I MHC 
molecules consist of the majority of genes so far 
identified in the MHC locus. Nonclassical class I MHC 
molecules are overall structurally related to the classical 

25 class I MHC transplantation antigens in having extensive 
sequence homology and a heavy chain noncovalently 
associated with SjM. Nonclassical class I MHC molecules 
are, in general, less polymorphic than the classical class 
I MHC molecules and are more circumscribed in their tissue 

30 distribution. Several types of nonclassical class I 
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molecules are expressed principally in the gastrointestinal 
(GI) tract, raising questions regarding their function, if 
any in the immune system. 



MHC class II antigens are expressed principally by 
5 specialized antigen presenting ceils in the body. Such 
cells are limited to the antibody producing B lymphocyte as 
well as macrophages and dendritic cells distributed in 
various tissues of the body. The class II molecule on the 
cell-surface is composed of an a chain of 33 kDa and a fi 
10 chain of 28 kDa associated noncovalently . Class II 
molecules as understood under the old paradigm function 
principally to present peptides derived from self or 
foreign proteins to a specialized class of T lymphocyte 
that supports the development of cytotoxic T lymphocytes, 
15 provides immunity to fungal infections and assists B 
lymphocytes in the generation of protective antibody 
responses to encapsulated bacterial infections. MHC class 
II antigens present peptide fragments derived from proteins 
taken up by cells from the surrounding environment, in 
20 contrast to classical class I molecules, which present 
peptides derived from endogenously synthesized proteins. 

A variety of human autoimmune diseases have been shown 
to be associated more frequently in the population with 
individuals who inherit certain genes of the MHC. For many 
25 of these diseases, the association is localized to the 
region of the MHC encoding class II histocompatibility 
antigens. These diseases are not inherited by simple 
Mendelian segregation of MHC genes, since only one sibling 
of a set of identical twins may have the disease. This 
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feature suggests that other genetic factors or 
environmental factors have roles in the development of 
autoimmunity, with genes in the MHC playing a significant 
part of the process. 

5 The old paradigm for MHC gene function provides 

several theories to explain a role for MHC genes in 
autoimmune disease. They include the inappropriate 
expression of class II MHC molecules in cells eliciting the 
autoimmune response or aberrant recognition of 

10 self -peptides by particular MHC gene products. Such 
theories, however, remain to be proven. In addition, the 
old paradigm fails to provide a useful hypothesis to 
explain the basis for an MHC- associated iron storage 
disease known as hemochromatosis. This disease is known 

15 from animal studies and from the genomic structure of 
several class I genes to involve an MHC encoded class I 
molecule since deletion of the l^M gene in these animals 
results in the disease. 

Disclosed herein is a new paradigm relating to 
20 carbohydrate-recognition molecules including at least some 
of class I, class II antigens and other related molecules, 
some of which are of the IgGSF. This paradigm provides 
that carbohydrate-recognition molecules have a central 
function to recognize and modify carbohydrate structures. 
25 Thus, there exists a need to develop new approaches to the 
treatment of MHC-associated and other diseases involving 
problems with carbohydrate -recognition, and in particular, 
a need to provide 
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composition, apparatus and methods for utilizing the new 
paradigm . 

SUMMARY OP TTTF nnnnrrTftrc 

5 In various aspects of the present invention, 

carbohydrate ligands and peptide mimetopes of such 
ligands are provided which specifically bind to leczymes, 
and leczymes are provided which specifically bind to 
carbohydrate ligands and peptide mimetopes. Other 

10 aspects of the invention include methods for identifying, 
purifying and producing such carbohydrates, mimetopes and 
leczymes, and the use of such compositions in treating 
various diseases and otherwise modifying the functions of 
cells. Still other aspects of the invention involve 

15 diagnosis of various diseases involving -carbohydrate 
ligands, peptide mimetopes of such ligands, and/or 
leczymes. Also provides are methods for modifying a 
cell to produce a carbohydrate ligand by introducing an 
expression vector encoding a leczyme into the cell, 

20 wherein the expression of the leczyme produces the 
carbohydrate ligand. 

In an exemplary embodiment, an immune response can 
be modulated by administering a carbohydrate ligand or a 
peptide mimetope to a patient. In another exemplary 
25 embodiment, hemochromatosis can be diagnosed by 
detecting a mutation in a class I MHC molecule that 
reduces its ability to associate with fi 2 microglobulin. 
In still another exemplary embodiment, the uptake of iron 
or other metal in a patient with abnormal absorption for 
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that metal can be modulated by administration of an 
appropriate carbohydrate. 



DETAILED DESCRIPTION OF TOE T KVEHTTON 



5 The present invention results from a profound new 

paradigm for the function of self -recognition molecules 
and other molecules in multicellular organisms, including 
mammals, which have a distinct population of phagocytic 
cells. The new paradigm holds that many types of 
10 self -recognition molecules heretofore known as peptide 
recognition and presentation structures have a more 
central function in the recognition and modification of 
carbohydrate -containing molecules. As used herein, 
"carbohydrate-containing molecules" means those molecules 
15 having at least one monosaccharide or sugar residue. 

Thus, self -recognition molecules and other molecules of 
the new paradigm have the ability to specifically bind a 
carbohydrate -containing molecule, and chemically modify 
it either by catalyzing further addition of carbohydrate 
20 or by catalyzing chemical modification of one or more of 
the existing carbohydrate residues. Therefore, the 
carbohydrate containing molecule is a substrate for the 
self -recognition and other molecules of the new paradigm. 
Additionally, after enzymatic modification, the 
25 self -recognition and other molecules can specifically 
bind with equivalent or greater affinity to the modified 
carbohydrate containing molecule than to the molecule 
originally recognized. 
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As used herein, "leczyme" means any cellular 
protein which can catalyze the chemical modification of a 
carbohydrate-containing substrate, resulting in a product 
with additional carbohydrate or chemically modified 
5 carbohydrate, and which then is capable of specifically 
recognizing the modified substrate. The term "leczyme" 
derives from the combination of having both 
lectin-binding and enzymatic activity in the same base 
molecule. Leczyme function is characteristic of many 
10 peptide recognition molecules that are well known in the 
art. Such molecules include at least some of class I and 
class II MHC encoded molecules, along with other members 
of the immunoglobulin gene superfamily (IgGSF) of 
molecules, some of which are not self -recognition 
15 molecules. Exemplary leczymes include those which 
catalyze phosphorylation, acetylation, carboxylation 
and/or sulfation of a carbohydrate- containing molecule. 
Leczymes can function within a cell, on the cell -surface, 
or can be secreted from a cell. 

20 

A leczyme can exhibit enzymatic activity and 
carbohydrate binding activity in the same isoform of the 
molecule or these activities can reside separately in 
different isoforms of the molecule. For example, 

25 differential RNA splicing of a leczyme can result in an 
enzyraatically active isoform of the leczyme which 
contains a signal (s) directing the leczyme to sites in 
the cell normally associated with glycosylation, such as 
the endoplasmic reticulum or the golgi complex. 

JO Differential RNA splicing can also result in an isoform 
of the leczyme that exhibits carbohydrate recognition 
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capability and contains a signal (s) directing the 
receptor to the cell-surface or to export from the cell. 
Alternatively, a leczyme expressed either in the cell or 
on the cell -surface can contain both enzymatic activity 
5 as well as carbohydrate recognition capability in the 
same molecule. 



In class I or class II molecules, and possibly in 
other leczymes, leczyme function can be resident in the 

10 groove formed at the top of MHC encoded molecule. The 
old paradigm characterizes the groove as a 
peptide -binding groove, but the new paradigm provides 
that the groove functions principally to recognize a 
carbohydrate structure and its subsequent form after 

15 modification by the leczyme. 

Compositions of substantially purified carbohydrate 
ligands can be isolated which specifically bind to a 
leczyme. As used herein, the term "carbohydrate ligand" 
means a carbohydrate -containing molecule where the 

20 carbohydrate portion is a part of the ligand that is 
recognized by the leczyme. A carbohydrate ligand is 
composed of one or more sugar residues that are linked in 
either straight chain or branched chain configuration. 
Therefore, a carbohydrate ligand includes a 

25 monosaccharide, disaccharide, trisaccharide, 
oligosaccharide or polysaccharide. 



Carbohydrate ligands composed of multiple 
carbohydrate residues can vary in the type and location 
of the linkage between each residue. Carbohydrate 
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residues useful for producing a carbohydrate ligand 
include, for example, glucose, galactose, fucose, mannose 
and sialic acid. Carbohydrate residues of a ligand also 
can be acetylated, phosphorylated and/or sulfated by 
5 chemical processes well known in the art. A carbohydrate 
ligand also can be chemically bonded to other molecules 
such as a lipid, glycolipid, protein, glycoprotein, 
proteoglycan, glucosaminoglycan or an organic molecule. 
Such additional molecules can provide the carbohydrate 
10 ligand with features such as increased binding to the 
leczyme or increased stability in vivo. 



A carbohydrate ligand can be multivalent in nature 
by having more than one carbohydrate ligand attached to a 
backbone structure. The backbone structure can be a 

15 natural protein such as a serum albumin -or can be a 
synthetic molecule such as a synthetic peptide. 
Approaches to link multiple carbohydrate ligands to a 
backbone structure are known in the art and include, for 
example, biotin-avidin linkage (Rothenberg et al., Proc. 

20 Natl, ftyrafl, Sea ■ flTfift) 90:11939-11943 (1993), which is 
incorporated herein by reference) . 

Application of the new paradigm yields new methods 
to treat disease states resulting from leczymes. Such 
disease states may include, for example, autoimmunity, 
25 hemochromatosis, inflammation, transplantation rejection, 
and infections. In many of the above disease states, 
disease may result in part from aberrant recognition of 
carbohydrate structures by lymphocytes. Thus, the 
administration of a carbohydrate ligand that can bind to 



WO 96/17870 PCT/DS9S/15913 

10 

the aberrant recognition molecule of an individual 
provides a means to disrupt the aberrant recognition 
mediating the disease. 

A variety of leczymes exist that differ in their 
5 ability to modify particular types of molecules. This 
difference results in part from differences in the 
specificity of the lectin binding site that leczymes have 
for their substrate. Thus, a part of the leczyme 
structure is a recognition site for the substrate. The 

10 catalytic site of a leczyme can be the same site as the 
substrate recognition site or can be a site different 
from the substrate recognition site. After modification 
of the substrate, the leczyme can exhibit similar or 
greater binding affinity for the modified substrate over 

15 the original substrate due to coordinate binding by both 
the substrate recognition site and the catalytic site of 
the leczyme or by multivalency of the ligand. 

Leczymes provided herein include a broad group of 
structurally related molecules, many of which are 

20 contained within the IgGSF. The IgGSF series of genes 
share an evolutionary homology (i.e., common ancestor) 
but are not necessarily functionally related, genetically 
linked or coordinately regulated. The products of the 
IgGSF have been defined by the presence of one or more 

25 regions homologous to the basic structural unit of 
immunoglobulin (Ig) , known as the Ig homology unit. 
These units are characterized by a primary amino acid 
sequence of about 70-100 residues in length and include 
an essentially invariant disulfide bridge spanning 50-70 
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residues in length and several other relatively conserved 
residues that maintain a tertiary structure known as the 
Ig fold (for review see Hunkapiller and Hood, Adv. 
Immunol 44:1-63, (1989)). 

5 The genes of the IgGSF encode many molecules with 

known immunological function, such as the 
immunoglobulins, T lymphocyte receptors, classical and 
nonclassical MHC molecules, various T lymphocyte and B 
lymphocyte cell -surface molecules or BjM. In addition, 

0 the IgGSF encodes several cell -surface molecules known to 
function as receptors for cell -cell adhesion. Such 
adhesion molecules include, for example, the neural cell 
adhesion molecule carcinoembryonic antigen. Of course, 
those IgGSF molecules which either do not modify the 

5 carbohydrate portion of the carbohydrate ligand, or which 
do not recognize the modified product are not leczymes. 

Leczymes of the IgGSF include those encoded by genes 
located within the MHC region. In humans, the MHC is in 
a continuous stretch of DNA located on the short arm of 
) chromosome 6. The entire MHC in humans is called the HLA 
complex. In mice, the MHC is located on chromosome 17 
and contains the H-2, Q, T and M complexes. As used 
herein, the term "MHC-derived gene product" means any 
molecule that contains at least one polypeptide encoded 
- for by a gene located within the MHC. Leczymes that are 
MHC-derived gene products include class I and class II 
molecules. Class I and class II molecules that are 
leczymes in humans are encoded by genes within the HLA-D 
region such as HLA-DP, HLA-DN, HLA-DM, HLA- DO, HLA-DQ or 
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HLA-DR, or the various alleles of HLA-A, HLA-B and HLA-C 
loci, or the HLA-X, HLA-E, HLA-J, HLA-H, HLA-G and HLA-F 
genes . 

Leczymes that are class I MHC molecules contain an 
5 approximately 45 kDa polymorphic heavy chain or a chain 
associated noncovalently with a small nonpolymorphic 
protein called SaM. The heavy chain is an MHC-encoded 
gene product located in or near the A, B or C regions of 
the human HLA complex and within or near the K or D/L 

10 regions of the mouse H-2 complex. Although l^M is encoded 
by a gene located outside the MHC and on a different 
chromosome, the heavy chain of the class I molecule is 
encoded by a gene located within the MHC, thereby 
including a class I molecule within the definition of an 

15 MHC-encoded gene product. 

Leczymes that are class II MHC molecules include 
those MHC-derived gene products composed of a 34 kDa a 
chain associated noncovalently with a 28 kDa S chain. An 
additional chain called the invariant chain is 
20 transiently associated with the class II heterodimer 
during transport to the plasma membrane of the cell. 

Leczymes can be expressed on the cell -surface by 
virtue of having a transmembrane region and cytoplasmic 
tail,, as in the case of the classical transplantation 
25 antigens. Leczymes also can be linked to the 

cell-surface in a manner similar to some nonclassical 
class I molecules. For example, many of the nonclassical 
class I Qa and Tla molecules are linked to the 
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cell-surface by a phosphatidylinositol (PI) linkage, and 
the product of the Q10 gene appears to be secreted 
(Devlin et al., EMBO J. 4:369-374 (1985)). The majority 
of Qa and Tla antigens lack the classical class I 
5 cytoplasmic exons including the phosphorylation site in 
exon seven (Thor et al., J. Immunol . 151:211-224 
(1993)), although the transmembrane domain and the 
seventh exon is present in Ql and Q2 gene products. 

The MHC class I heavy chain is organized into three 
10 external domains (al,a2 and a3) , each containing about 90 
amino acids each, a transmembrane domain of about 40 
amino acids and a cytoplasmic anchor segment of about 3 0 
amino acids. l^M is similar in size and in organization 
to the external a3 domain of the heavy chain. X-ray 
15 crystallographic analysis of the extracellular portion of 
the MHC class I molecule shows that the al and cr2 domains 
interact and are most external to the cell membrane while 
the 0(3 and fijM domains interact and are more proximal to 
the cell membrane. The interacting al and a2 domains 
20 form a platform containing a deep groove or cleft located 
on the top surface of the molecule. 

The old paradigm for the function of the classical 
class I MHC molecule interprets the groove at the top of 

25 the molecule as a peptide binding site. The site is 

sufficiently large enough to bind a peptide of about 8-20 
residues in length and present both self and 
foreign-derived peptides for recognition by certain T 
lymphoid cells. Extensive research has shown that the 

3 0 MHC classical class I molecule can bind peptide of about 
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the length of the groove. In addition, the x-ray 
crystallographic analysis of a classical class I molecule 
purified from a cell indicated that a peptide was 
resident in the groove. However, as described above, the 
new paradigm in the present invention provides that the 
peptide binding groove of the classical class I molecule 
MHC is also suited for binding a carbohydrate ligand. 

Leczymes that are a class II MHC molecule include 
those that share significant structural features with a 
class I molecule. The class II molecule is a membrane 
bound glycoprotein that contains external domains, a 
transmembrane segment, and a cytoplasmic anchor segment. 
The a chain contains two external domains labeled al and 
a2 and the £ chain contains two external domains Si and 
62 domain. X-ray crystallography shows that the a2 and 
22 domains interact as a membrane proximal structure 
analogous to the a3 domain and l^M domain interaction of 
the class I molecule. The or2 and £2 domains of a class 
II molecule that together form a cleft at the top of the 
molecule that is very similar to the cleft formed by the 
a2 and a3 domains of a class I heavy chain. Extensive 
evidence indicates that the groove in the class II 
molecule can bind and present both self and foreign 
peptides for recognition by T lymphoid cells. Peptides 
have been isolated from the class II molecule that are 
from 13-18 amino acids in length, slightly longer that 
the octomeric or nonomeric peptides commonly isolated 
from MHC classical class I molecules. As discussed 
above, the new paradigm of the present invention provides 
that the peptide binding groove in the class II MHC 
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molecule, like the groove in the classical class I MHC 
molecule, is also suited for binding a carbohydrate 
ligand. 

Leczymes may also be encoded by nonclassical class I 
5 genes. In the mouse, genes encoding leczymes are located 
in the MHC regions Q, T and M downstream of the classical 
histocompatibility antigens. There are similar regions 
in humans coding for known nonclassical class I molecules 
such as HLA-F and HLA-G. The nonclassical class I genes 
10 are overall less polymorphic than the classical class I 
genes and show different patterns of expression. The Q, 
T and M complex genes of mice consist of approximately 45 
genes, coding for non-polymorphic differentiation 
antigens with limited tissue distribution. 



15 Known leczymes which are nonclassical class I MHC 

molecules exhibit limited tissue distribution in 
comparison with leczymes that are classical class I MHC 
molecules. For example, the Qa and Tla antigens, the 
products of the Q and T genes, are expressed on 

20 subpopulations of lymphocytes (for review, see Flaherty 
et al. Critical Reviews in Immunology. 10:131-175 
(1990)). Previously, no convincing function had been 
assigned to the products of the nonclassical class I 
genes, although they have been suggested as possible 

25 restriction elements for yb T cells (Hershberg et al. 

Proc. Nat. Acad. Sci fUSA) . 87:9727-31 (1993)). The Qa 
and TLa antigens have also been reported to be expressed 
on intestinal epithelium (Wu et al, J . Exp . Med . , 
174:213-218 (1991); Hershberg et al . , Proc. Natl. Acad. 
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Sci, (Wflft) 87:9727-97231 (1990); Wang et al . , 
IrnmuncXTfinpr , , 38:370-372 (1993)) where their function was 
unknown. The new paradigm of the present invention 
provides that at least some of these nonclassical class I 
5 molecules are leczymes. 

The nonclassical class I molecule Q2, produced by a 
gene within the mouse MHC, is an example of a leczyme 
that is involved in iron transport (see Example I) . In 
this instance a mucin or other carbohydrate containing 
10 molecule acts as an iron chelator, and in turn also 
contains the ligand for leczymes expressed from the Q2 
gene. The gene for Q2 is located in a head to head 
relationship with another gene most likely encoding a 
mucin. Both genes share a single promoter region, 
15 located between the genes, the promoter being analogous 
in structure to the S-globin promoter involved in iron 
metabolism. The coordinated regulation of these two genes 
can be readily understood in view of the receptor/ligand 
and receptor/substrate interactions defined as leczyme 
20 function in the new paradigm. Interestingly, the Q2 gene 
is distinguished from other nonclassical class I genes in 
being highly polymorphic with Q2 molecules of different 
strains of mice differing significantly in amino acid 
sequence. Despite these differences, the Q2 molecules 
25 from separate strains of mice all function as a receptor 
for their co-regulated gene product since, as a leczyme, 
Q2 can enzymatically modify its ligand/substrate in 
accordance with the lectin recognition and enzymatic 
function of each Q2 gene product and can recognize the 
30 resulting product. Thus, the combined 
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enzymatic/recognition capability of a leczyme as defined 
in the new paradigm maintains receptor/ligand 
relationships in the face of extensive genetic 
polymorphism. 

5 Leczymes exist with a variety of enzymatic 

activities. For example, a leczyme can have a glycosyl 
transferase enzymatic activity that results in the 
catalytic transfer of a glycosyl group {mono or 
carbohydrate) from a glycosylnucleotide to an acceptor 
10 molecule such as a protein, carbohydrate or lipid. 
However, not all glycosyl transferases are leczymes. 

There is currently only one glycosyl transferase 
(Sl,4-galactosyltransferase) that is previously known to 
be expressed in both the cytoplasm and- on the cell (for 
15 a review see Shur, Curr. Opi n. in Cell Bin! s , fl^-fl^ 
(1993)). This enzyme has both carbohydrate recognition 
capability and carbohydrate catalytic activity and has 
been implicated in a variety of cell -cell and cell -matrix 
interactions. One hallmark of the cell-surface expressed 
20 form of fil, 4 -galactosyl transferase is that it no longer 
retains binding activity for the product it generates 
after enzymatic modification (Miller et al., Nature , 
357:590-593 (1992)). Thus, this particular transferase 
is not a leczyme because it fails to exhibit recognition 
25 for its enzymatic product. 

A leczyme of the IgGSF can be encoded by a gene 
located outside the MHC. For example, CD-I is a product 
of the IgGSF gene that is related in structure to the 
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class I MHC molecule, but the CD-I heavy chain is encoded 
by a gene outside the MHC. The T-6 CD-I molecule is 
expressed by a specialized antigen presenting cell in the 
skin (Langerhan's cell) and can be internalized into the 
5 cell along with MHC class II antigen, indicating an 
immunological function for T-6. 



A composition can be prepared which comprises a 
substantially purified carbohydrate ligand that is 
specifically bound by a leczyme. As used herein, the 

10 term "substantially purified" means a carbohydrate ligand 
that is relatively free from other contaminating 
molecules such as lipids, proteins, nucleic acids, 
carbohydrates or other molecules which are not normally 
associated with the carbohydrate ligand. Also as used 

15 herein, the term "specif ically bound" means having 

sufficient complementarity to achieve a concentration of 
unbound substrate in the presence of sufficient leczyme 
of at least 10" 1 Molar, and preferably 10* 6 Molar or 
greater. A substantially purified carbohydrate ligand 

20 can be obtained, for example, using well known 

biochemical methods of purification of a carbohydrate 
source or by chemical or enzymatic synthesis. 



A carbohydrate ligand of the present invention can 
include known carbohydrate containing molecules such as 
25 glycoproteins, proteoglycans, glycolipids or 

mucopolysaccharides that have N- linked or O-linked forms 
of glycosylation. Proteoglycans include, for example, 
mucins and those proteoglycans glycosylated with 
hyaluronate, chondroitin sulfate, heparin, heparin 
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sulfate or dermatin sulfate. Glycolipids include, for 
example, acylglycerols, sphingoids and ceraraides. 



A sample containing a substantially purified 
carbohydrate ligand can be obtained from a variety of 
5 sources such as from fluids, tissues or cells. These 
sources can be from any plant species or any animal such 
as a mammal or any organism. A source of carbohydrate 
ligand can also include a cell that has been modified by 
introducing into the cell an expression vector that 
10 encodes a leczyme or a protein that when expressed 
contains a carbohydrate ligand. 

A sample containing a carbohydrate ligand can be 
obtained from a chemically produced library of 

15 carbohydrates. Such libraries can be made by mixing 
carbohydrates from natural sources and from 
enzymatically-produced sources. In addition, individual 
carbohydrates from the library can be tagged with a 
detectable label such as a fluorescent label to assist in 

20 structural determination of the carbohydrate ligand. 

A sample containing a carbohydrate ligand can be 
processed to further purify the ligand by methods well 
known in the art. Such methods include, for example, 
purification of glycoconjugates, labeling of 
25 glycoconjugates by chemical or metabolic means, release 
of carbohydrates from glycoconjugates and 
characterization of the structure of the released 
carbohydrate (see, for example, Ausabal et al, In Current- 
ProtOCOTfi in Mol fibular RinlogY Vol. 2, chapter 17, (Green 
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Publishing Associates and Wiley Interscience, New York, 

1994); Fukuda and Kobata, Gl vcohi nl o gv : A nrarUfvil 

Approach , (IRL Press, N w York, 1993), both of which are 
incorporated herein by reference) . In addition, these 
5 methods are useful for structural characterization, 
including sequencing of the carbohydrate ligand. 
Elucidation of the structure of a carbohydrate ligand 
purified from a tissue or a cell can enable future 
production of the ligand by direct chemical synthesis or 
10 enzymatic synthesis or purification from a natural 
source . 



A carbohydrate ligand that can bind to a leczyme can 
be identified by several methods. In one method, a 
sample containing a carbohydrate ligand is contacted with 
15 a leczyme suspected of binding to the ligand under 
suitable conditions to allow specific binding of the 
ligand to the leczyme. Suitable conditions include, for 
example, an appropriate buffer concentration and pH and 
time and temperature that permits binding of the 
20 particular leczyme and the carbohydrate ligand. After a 
suitable reaction period, the amount of carbohydrate 
ligand bound to the leczyme can be determined, for 
example, by attaching a detectable moiety such as a 
radionuclide or a fluorescent label to the carbohydrate 
25 ligand and measuring the amount of label that is 
associated with the leczyme after any unbound 
carbohydrate ligand has been removed from the 
ligand -leczyme complex. 
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As used herein, "detectable label" means a molecule 
whose presence can be detected due to a physical, 
chemical or biological characteristic of the molecule. 
Detectable labels include, for example, radioisotopes, 
5 fluorescent molecules, enzyme/substrate systems, or 
visually detectable molecules. Methods for detectably 
labeling a carbohydrate molecule are well known in the 
art, and include, for example, reduction with NaB( 3 H) 4 or 
synthesis with radiolabeled carbohydrates (see, for 
0 example, Varki, supra, 1994 and Rothenberg et al., Proc. 
Natl ■ ftrfld , fi ri , f T jfi ft ) , 90:11939-11943 (1993), both of 
which are incorporated herein by reference, and Fukuda 
and Kobata, supra 1993) . In addition, kits for the 
preparation of a labelled carbohydrate molecule are 
> readily available from commercial sources such as Oxford 
GlycoSystems (Rosedale, NY) . 



Methods to remove unbound labelled ligand from the 
ligand-leczyme complex may depend, for example, on 

20 attaching the leczyme to a solid support. Solid supports 
useful in the present invention and methods to attach 
proteins to such supports are well known in the art (see 
for example Harlow and Lane, Antihnrii ^: a laWai-rvry 
Manual (Cold Spring Harbor Laboratory Press, Cold Spring 

25 Harbor, New York, 1988), which is incorporated herein by 
reference) . Such solid supports include, for example, 
Sepharose, agarose or polystyrene. 

After a suitable reaction period and after any 
unbound label has been removed from the support by, for 
10 example, washing, the amount of label attached to the 



WO 9*17870 PCT/US95/15913 

22 

solid support provides a direct measurement of the amount 
of carbohydrate ligand bound to the leczyme on the 
support. Alternatively, the amount of labelled 
carbohydrate ligand bound to the support can be 
5 indirectly determined after the reaction period by 
measuring the amount of unbound label and subtracting 
this from the amount of label added at the start of the 
reaction. 

To accurately determine the amount of labelled 

L0 ligand that binds specifically to the leczyme, a control 
reaction can be performed where all conditions are the 
same as in the binding reaction between the labelled 
ligand and leczyme except that the leczyme is not 
included in the control reaction or the leczyme is 

.5 replaced by an irrelevant protein such as a serum 

albumin. The control reaction determines the amount of 
binding of the labelled carbohydrate ligand that occurs 
nonspecifically such as the amount of labelled ligand 
that binds to the solid support rather than to the 

0 leczyme on the solid support. Thus, it is necessary to 
subtract the nonspecific binding value obtained from the 
control reaction from the binding value obtained from the 
reaction that included both the labelled carbohydrate 
ligand and the leczyme to determine the amount of ligand 

5 that specifically bound the leczyme under the conditions 
tested. 

An advantage of using a solid support is that the 
labelled ligand can be added in excess relative to the 
leczyme, making it possible to identify lower levels of 
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binding affinity between the carbohydrate ligand and the 
leczyme. Methods such as Scatchard analysis are well 
known in the art for determining the binding affinity 
between two molecules, both of which can be in solution 
5 or one of which can be attached to a solid support. 

Equilibrium dialysis is an example of a method where the 
binding of a ligand to leczyme can be determined when 
both molecules are in solution. 

Methods to measure the binding of a labelled 
10 carbohydrate ligand to a leczyme also can be performed 
when the leczyme is associated with a cell. In a manner 
analogous to the use of solid supports, cells that 
express the leczyme on the cell -surface can bind the 
labelled carbohydrate ligand and, after a suitable 

15 reaction period, the cells can be separated from the 

unbound ligand by methods well known in the art such as 
by centrifugation or filtration. Cells that express a 
leczyme in the cytoplasm can also be used to detect 
binding of a carbohydrate ligand to the leczyme provided 

20 the cell membrane has been sufficiently permeabilized to 
allow access of the carbohydrate ligand to the leczyme in 
the cell. Methods that use cells in binding assays such 
as antigen-antibody binding assays are well known in the 
art (see, for example, Harlow and Lane supra, 1988) and 

25 are generally applicable to binding assays between a 
carbohydrate ligand and a leczyme. 

A leczyme -expressing cell can be a cell that 
naturally expresses the leczyme, such as a lymphocyte 
that expresses a class I or class II MHC encoded leczyme. 



10 
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A leczyme- expressing cell can also be a cell that 
expresses the leczyme as a result of introducing an 
expression vector encoding the leczyme into the cell. 
Leczyme-expressing cells can be obtained from in vivo 
5 sources by methods well known in the art such as 

mechanical disruption of tissue or digestion of tissue by 
enzymes to release cells from their surrounding matrix 
(see for example. Freshney Culture of mj ma1 ro1 1 „ (Alan 
R. Liss, New York, 1993), which is incorporated herein by 
reference) . A leczyme-expressing cell can be a cell line 
that is available from public cell repositories such as 
from the American Type Culture Collection. 

It is well known in the art that the binding between 
two molecules can be performed when either of the two 

15 molecules contains a detectable label. Thus, the 
identification of a detectably labelled carbohydrate 
ligand that binds to a leczyme attached to a solid 
support or a cell also can be performed if the leczyme 
contains the detectable label and the carbohydrate ligand 

20 is attached to a solid support or expressed by a cell. A 
leczyme can be detectably labelled using methods for 
labeling a protein, which are well know in the art and 
include, for example, biotinylation or incorporation of 
radioisotopic labelled precursors. A carbohydrate 
25 ligand-expressing cell can be a cell obtained from 

tissues or organs or can be a cell line such as a cell 
line available from a public repository. 



Methods for attaching a carbohydrate ligand to a 
solid support depend on the chemical nature of the 
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ligand. Thus, attachment can be accomplished through the 
carbohydrate moiety or other molecule bonded to the 
carbohydrate ligand attachment via chemistry suitable for 
attaching carbohydrate, peptide or lipid structures to a 
5 solid support. Methods to attach carbohydrates, proteins 
or lipids to various types of solid supports are well 
known in the art. 

The binding of a carbohydrate ligand to a leczyme 
can be determined without the need for a detectable label 

10 by measuring a physical characteristic of the either the 
ligand or the leczyme such as absorption of ultraviolet 
radiation. Such methods for quantitating a protein or 
carbohydrate by physical characteristics are well known 
in the art. The ability to follow a physical 

15 characteristic of the ligand or leczyme -can be applied to 
binding assays that use a solid support or an expressing 
cell or when both molecules are in solution. The binding 
of a carbohydrate ligand to a leczyme also can be 
evaluated if the ligand is a substrate for the enzymatic 

20 activity of the leczyme. In this case, binding can be 
measured by following substrate conversion kinetics 
measured, for example, by the Michealis-Menten equation 
(Devlin, Textbook of Bingh^mi ^17 (Wiley-Liss Inc. New 
York, 1992) , which is incorporated herein by reference) . 

25 Methods for identifying a carbohydrate ligand that 

binds a leczyme can be performed using a single purified 
carbohydrate ligand or a limited number of carbohydrate 
ligands, which can be purified by conventional procedures 
as described above or can be purified by binding to a 
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binding reagent. As used herein, M binding reagent" 
means a chemical or biological molecule that can 
specifically bind to a carbohydrate ligand. For example, 
a leczyme that binds to a carbohydrate ligand is a 
5 reagent that can be used to purify that ligand. Also, an 
antibody can be a reagent if it can react specifically 
with the carbohydrate, protein or lipid portion of a 
carbohydrate ligand. 



A purified carbohydrate ligand can also be 
10 detectably labelled by methods disclosed herein. A 
carbohydrate ligand that is not purified, such as one 
that is in a sample containing other molecules, can be 
used in a binding assay provided it is attached to a 
solid support or is expressed by a cell and binding is 
15 determined by detecting binding of a leczyme. In this 
case, if the non-purified carbohydrate ligand can bind 
the leczyme, the sample containing the ligand can be 
subjected to purification and subsequent binding assays 
to obtain the carbohydrate ligand in a purified state. 

20 

Purified leczymes can be obtained from cells by 
classical methods for protein or glycoprotein 
purification such as methods known in the art for 
purifying class I or class II molecules. Leczymes also 

25 can be obtained from cells that have been modified by 

molecular biological techniques to enable expression of a 
leczyme. A gene encoding a leczyme can be cloned into an 
expression vector and then introduced into a host cell. 
Vectors are well known in the art and include, for 

30 example, cloning vectors and expression vectors, as well 
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as plasmids or viral vectors (see, for example, Goedell, 
Methods in EnT i vmology , vol. 185 {Academic Press, New 
York, 1990) , which is incorporated herein by reference) . 
A baculovirus vector is an example of a vector that can 
5 be used to express a leczyme in insect cells and result 
in expression of new carbohydrate ligands on the cell. 

A vector comprising a nucleic acid molecule encoding 
a leczyme also can contain a promoter or enhancer 
element, which can be constitutive or inducible and, if 

10 desired, can be tissue specific. Host cells also are 
known in the art and an appropriate host cell can be 
selected for the particular vector to be used. For 
example, a baculovirus transfer vector can be used with 
baculovirus DNA to infect insect cell lines such as SF21 

15 cells. Cloning of such transformed cells to produce a 
stable cell line can provide a source of the expressed 
leczyme or can provide a source of carbohydrate ligand 
modified by the expressed leczyme. 

The gene encoding a leczyme can be expressed as a 
20 fusion protein to assist in purification or in further 
downstream processing of the leczyme. For example, the 
leczyme can be produced as a chimeric protein fused to 
the CH2 or CH3 domain that constitutes the Fc binding 
region of an immunoglobulin molecule, as was performed 
25 previously for expressing the CD22fi lectin (Stamenkovic 
et al. 66:1133-1144 (1991)). The use of Protein A 

from Staphylococcus aureus bound to a solid support, 
which is readily available from commercial sources, can 
be used to purify the Fc containing chimeric leczyme. In 
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addition, the solid support containing the chimeric 
leczyme can be used directly to evaluate binding of a 
carbohydrate ligand. 



The methods that have been described above for 
5 identifying a carbohydrate ligand that binds to a leczyme 
can also be used to identify a leczyme that binds to a 
carbohydrate ligand. For example, a sample containing a 
leczyme is contacted with a carbohydrate ligand suspected 
of binding to the leczyme under suitable conditions to 

10 allow specific binding of the ligand to the leczyme. 
Leczymes to be identified for binding include, for 
example, a purified leczyme or a leczyme contained within 
a complex mixture such as a mixture of proteins expressed 
from a cDNA expression library. Methods to produce a 

15 cDNA expression library are well known in the art (see, 
for example, Sambrook et al, Molecular Cloning r A 
laboratory manual (Cold Spring Harbor Laboratory Press 
1989) , which is incorporated herein by reference) . 



A carbohydrate ligand can also be purified by an 
20 affinity column and related technologies. For example, a 
sample containing the carbohydrate ligand can be 
contacted with the binding reagent under suitable 
conditions to allow formation of a ligand-binding reagent 
complex. Suitable conditions includes, for example, an 
25 appropriate buffer concentration and pH and time and 
temperature that permits binding of the carbohydrate 
ligand to the binding reagent. The ligand-binding 
reagent complex is then separated from the rest of the 
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sample by a separation method such as by washing, and the 
ligand is dissociated from the complex. 

As another example, a leczyme that binds to a 
carbohydrate ligand may comprise a binding reagent used 
5 to purify that ligand. An antibody can also comprise a 
binding reagent if it can react specifically with the 
carbohydrate, protein or lipid portion of a carbohydrate 
ligand. 



Purification of the carbohydrate ligand can be 
10 accomplished if the binding reagent is attached to a 
solid support such as agarose, Sepharose or plastic. 
Methods for coupling a protein or a carbohydrate to a 
solid support, disclosed above for detecting the binding 
of a carbohydrate ligand to a leczyme, also are useful 
15 for attaching a binding reagent to a solid support. 



Methods to dissociate a carbohydrate ligand from a 
ligand -binding reagent complex can depend on the nature 
of the binding reagent. For example, if the binding 
reagent is a leczyme, then a method for dissociating the 

0 complex can involve competitive inhibition of the 

complex with a carbohydrate structure that has binding 
affinity for the same site in the leczyme that binds the 
carbohydrate ligand. Other well known treatments that 
are useful for dissociating a carbohydrate ligand from a 

5 binding reagent include, for example, extremes in pH, 
high salt concentration or chaotropic agents (see, for 
example, Harlow and Lane, supra, 1988), which is 
incorporated herein by reference and Varki, supra, 1994). 
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Carbohydrate ligands purified by the above disclosed 
methods are suitable for structural analysis as described 
above, in order to enable future production of the ligand 
by chemical or enzymatic synthesis. 

5 An antibody that specifically binds to an 

oligosaccharide or polysaccharide -type carbohydrate 
ligand can be produced to the carbohydrate, or to a 
protein moiety or a lipid moiety if such moieties are 
bonded to the ligand. An antibody specific for the 

10 peptide backbone of carbohydrate ligand such as the 

peptide backbone of a mucin can be useful for purifying a 
source of mucin from different cells or from different 
individuals, since the peptide backbone can be more 
conserved between peptide containing carbohydrates than 

15 the carbohydrate portions of these molecules. Methods 
for producing antibodies such as polyclonal antibodies, 
monoclonal antibodies, antibody fragments or the like, 
that are specific for protein, carbohydrate or lipid are 
well known in the art (see, for example, Harlow and Lane 

20 supra, 1988) . 

A leczyme can be purified using methods whereby the 
leczyme specifically binds to a carbohydrate ligand. In 
these methods, a sample containing the leczyme is 
contacted with a carbohydrate ligand under suitable 
25 conditions to allow formation of a ligand-leczyme 

complex. Suitable conditions includes, for example, an 
appropriate buffer concentration and pH and time and 
temperature that permits binding of the leczyme and the 
carbohydrate. The ligand-binding reagent complex is then 
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separated from the rest of the sample by a method such as 
by washing, and the leczyme is dissociated from th 
complex . 



Purification of the leczyme can be accomplished if 
5 the carbohydrate ligand is attached to a solid support 
such as agarose, Sepharose or plastic. Methods for 
coupling a carbohydrate ligand to a solid support, such 
as those disclosed above for detecting the binding of a 
carbohydrate ligand to a leczyme, are useful for 
10 attaching a carbohydrate ligand to a solid support. 
Methods for dissociating the leczyme from the 
ligand-leczyme complex can utilize the methods disclosed 
herein for dissociating a carbohydrate ligand from a 
ligand-leczyme complex. 

15 A carbohydrate ligand that modifies the function of 

a leczyme-expressing cell can be identified by contacting 
a sample containing a carbohydrate ligand with the cell 
under suitable conditions, which allow specific binding 
of the ligand to the leczyme on the cell. After a 

20 suitable period of time to allow for binding of the 
ligand to the leczyme, the cells are evaluated to 
determine their function. A carbohydrate ligand that 
modifies the function of a leczyme-expressing cell is one 
that when contacted with the cell results in a function 

25 that differs from the function of the same type of cell 
that had not contacted the ligand. 

As used herein, ■' function" in reference to a cell 
includes any activity that can be detected for a cell. 
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The function of a cell can vary with the nature of the 
cell in question. For example, the function of a T 
lymphocyte can include activities such as the production 
of certain cytokines, acquisition of cell mediated 
5 lympholysis, ability to mediate antibody dependent cell 
mediated cytotoxicity or the ability to help B 
lymphocytes produce antibody. Thus, a particular 
carbohydrate ligand that can bind to a leczyme on a T 
lymphocyte and subsequently effect the function of the 
10 cell can do so by increasing or decreasing any of the 
above T lymphocyte functions. As another example, 
addition of a complex carbohydrate to the epithelial 
cells of the gastrointestinal tract will affect iron 
absorption. 



15 Contacting a carbohydrate ligand with a 

leczyme-expressing cell can be performed in vitro in a 
cell culture medium. Methods for measuring the function 
of lymphoid cells or other cells are well known in the 
art (see for example, Colligan et al., Curr. Protocols in 

20 Immunol . (Greene Publishing Associates and Wiley 
Interscience, New York, 1992) ; Mishell and Shiigi, 
Selected Meth, in Cell. Immunol . (w. H. Freeman and Co., 
New York, 1980), each of which are incorporated herein by 
reference) . 

25 Methods described above for identifying a 

carbohydrate ligand that modifies the function of a 
leczyme-expressing cell are also useful for identifying a 
leczyme that modifies the function of a carbohydrate 
ligand-expressing cell. 
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compared to the amount of carbohydrate ligand that bound 
to leczyme in a control sample that did not contain 
peptide. If the amount of carbohydrate ligand that bound 
to the leczyme in the sample containing peptide is less 
5 than the amount of carbohydrate ligand that bound to the 
leczyme in the control sample, then it can be concluded 
that the peptide had bound to the carbohydrate ligand 
binding site of the leczyme and is therefore a peptide 
mimetope of the carbohydrate ligand. 

A peptide mimetope can be identified in an assay 
format that utilizes a carbohydrate ligand containing a 
detectable label and a leczyme that is bound to a solid 
support or is expressed by a cell. Methods disclosed 
herein for identifying a carbohydrate ligand that bind to 
a leczyme are useful to generate the assay .format for 
identifying a peptide mimetope of a carbohydrate ligand. 

A defined peptide sequence can be chemically 
synthesized or produced by biological methods, such as by 
recombinant DNA techniques (see, for example, Sambrook et 
20 al., supra, 1989). A complex mixture of peptides also 

can be used to identify a peptide mimetope. Such complex 
mixtures can include, for example, a mixture of defined 
sequences, or can be a semi-random or random library of 
sequences. Methods to generate peptide libraries by such 
25 methods as chemical synthesis on a bead or a microtiter 
plate or biological production such as on the surface of 
a bacteriophage are well known in the art (see, for 
example, Huse et al., Science 246:1275-1281 (1989), which 
is incorporated herein by reference) . 
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A peptide that can bind to the carbohydrate ligand 
binding site of a leczyme can also have some of the 
functional characteristics of a carbohydrate ligand and 
thus be considered a functional mimetope of the 
5 carbohydrate ligand. Such peptide mimetopes can be used 
to modify the function of a cell and also can be used to 
treat a disease state that involves a leczyme that can 
bind to the mimetope . 

A cell can be modified to produce a carbohydrate 
L0 ligand by introducing an expression vector encoding a 
leczyme into a cell to obtain expression of the leczyme, 
which results in production of the carbohydrate ligand by 
the cell. Cells producing a particular carbohydrate 
ligand are useful to provide unique types of ligands, 
5 which can be purified from the cells. In addition, such 
cells are useful in binding assays to identify a leczyme 
that binds the ligand. 

An immune response in an individual, such as a human 
or other animal, can be modulated using an antigen for 

0 which the immune response is desired and a carbohydrate 
ligand that binds to a leczyme. As leczymes include, for 
example, the major histocompatibility complex molecules 
that are involved in the immune system, injection of a 
carbohydrate ligand and an antigen can modulate an immune 

» response. As used herein, "modulate" means increase or 
decrease. An increase in the immune response can be 
obtained by administering a carbohydrate ligand bound to 
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antigen such that the antigen is targeted via the leczyme 
to an antigen presenting cell. 



An antigen can be associated with a carbohydrate 
ligand by covalently bonding the antigen to carbohydrate 
5 or to any protein or lipid of the ligand using methods 
well known in the art. The actual method to covalently 
couple the antigen to the carbohydrate ligand will depend 
on the nature of each molecule to be coupled and whether 
the coupling procedure is detrimental to the any critical 
10 antigenic determinants of the antigen or the capability 
of the carbohydrate ligand to bind its' target leczyme. 
Such detrimental effects can be readily evaluated in 
binding assays as described above. 



More than a single antigen molecule or more than a 
15 single carbohydrate ligand can be coupled together to 
produce an immunogen. Such molecules can be made 
multivalent for either or both of the antigen or the 
carbohydrate ligand and can be used for eliciting a 
greater immune response than an immunogen containing a 
20 single molecule of antigen and a single molecule of a 
carbohydrate ligand. 



Methods to increase an immune response in an 
individual are well known to those in the art and require 
optimization of parameters such as dose, route of 
25 administration, use of an adjuvant, or schedule of 
administration (see, for example, Harlow and Lane, 
chapter 5, supra, 1988) . These same methods can be 
adapted for use with carbohydrate ligands. An increased 
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immune response obtained after administering an antigen 
and a carbohydrate ligand is achieved when the immune 
response parameter has increased by a statistically 
significant level over the level of the parameter 
5 manifested prior to administration of the antigen and 
carbohydrate ligand. 

The immune response parameters that can increase 
after administering an antigen associated with a 
carbohydrate ligand include an antibody-mediated response 

10 or a cellular-mediated response. Methods to measure 

antibody immune responses are well known to those in the 
art and include, for example, detection of 
immunoglobulins by both in vitro and in vivo methods (see 
for example Harlow and Lane, supra, 1988). Methods to 

15 measure cellular immune responses are al*so well known in 
the art and include in vivo methods such as skin testing 
for delayed hypersensitivity and in vitro methods such as 
direct cell cytotoxicity or cell activation assays (see, 
for example, Colligan et al . supra, 1992; Mishell and 
20 Shiigi, supra, 1980) . 



An antigen associated with a carbohydrate ligand can 
also be used to decrease an immune response to the 
antigen and can be particularly useful for treating a 
deleterious immune response such as an autoimmune disease 
state. Methods for decreasing an immune response can, 
under some conditions result in a prolonged state of 
specific immunological unresponsiveness to the antigen, 
commonly referred to as a state of tolerance to the 
antigen. 
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Decreasing an immune response to an antigen by 
administering the antigen bonded to a carbohydrate ligand 
can be accomplished using methods well known in the art 
to suppress or tolerize an individual to an antigen. 
5 Such methods include, for example, administration of low 
doses, monomeric and nonaggregated forms of the antigen 
and carbohydrate ligand or administration orally. In 
addition, a decreased immune response can be obtained by 
administering the antigen and carbohydrate ligand 
10 concurrently with an immunosuppressive agent such as 
cyclosporin A, FK506 or antibodies to a particular T 
lymphocyte cell -surface receptor. Methods for using such 
agents to decrease the immune response to an antigen in 
humans or animals are well known in the art. 

15 A disease state involving a leczyme can be treated 

by administering an effective amount of a carbohydrate 
ligand that binds to the leczyme. As used herein, the 
term "disease state" includes any diseases, whether 
genetic or acquired, provided a leczyme plays a role in 

20 the disease process. Disease states which may be 
ameliorated using methods described herein include 
inflammation, transplantation rejection, and may also 
includes diseases having both a genetic and an 
environmental basis such as iron storage diseases, 

25 autoimmunity or cancer. In addition, other disease 

states which may be ameliorated using methods described 
herein include those resulting from an infectious agent 
such as a virus, bacteria, yeast or parasite. The 
ability of an infectious agent to enter and infect cells 

3 0 of the host can occur by binding to the leczyme or 



WO 96/17870 PCT/US95/15913 

39 

carbohydrate ligand expressed on the cells of the host. 
A peptide mimetope for a carbohydrate can also be used to 
treat a disease state that involves a leczyme for which 
the mimetope can bind. 

5 A disease state involving a leczyme can be treated 

by administering a leczyme having a similar binding 
specificity for a carbohydrate ligand as the leczyme 
involved in the disease state. The disease states useful 
for treatment by a leczyme include those described above 
10 for treatment by a carbohydrate ligand. Thus, aberrant 
self-recognition, mediated by a leczyme in a diseased 
individual, can be treated by administration of a 
leczyme. Such a leczyme can bind to the natural 
carbohydrate ligand detected on a target cell by the 
15 aberrant self-reactive leczyme -expressing cell, and, 
therefore, block the ability of the self -reactive 
leczyme-expressing cell to recognize and react aberrantly 
towards the target cell. 

An iron metabolic disorder known as hemochromatosis 
20 can also be treated using methods disclosed herein 

because defects in iron metabolism can have a basis in 
leczyme function. In elevated concentrations, iron is a 
toxic inorganic molecule that has been implicated in the 
pathophysiology of a number of common diseases. These 
25 include but are not limited to cancer (Stevens et al, IL 
fi nq l. J . MHl , 319:1047 (1988); Stevens, et al . , Mgd^ 
Qnco] . Tumor Ph a ifnq ror h Pr 7:177-181 (1990)), heart 
disease (Kannel, et al, 1976; Sullivan, Lancet . 
1:1293-1294 (1981); Salonen, et al, Circulation . 
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86:803-811 (1992)), reperfusion injury (Zweier, J. Biol. 
Chem . , 263:1353-1357 (1988)) and rheumatoid arthritis 
(Blake et al.. Arthritis Rhmim. . 27:495-501 (1984)). 
There is no argument that severe iron overload results in 
5 a constellation of pathologies, collectively called 

hemochromatosis, the most common genetic disease known to 
affect humans. 

Hemochromatosis results from enhanced absorption of 
iron from the GI tract by active transport but the gene 
or genes causing the underlying metabolic defect is 
currently unknown. Identification of the genes 
responsible for the absorption of iron, and developing an 
animal model in which iron overload is due to active 
enhanced absorption of iron from the GI tract, would 
greatly facilitate understanding hemochromatosis and 
increase knowledge about the general mechanisms of iron 
metabolism. As disclosed herein, an MHC-encoded leczyme 
may be involved in the pathogenesis of hemochromatosis. 

Hemochromatosis is not usually brought to clinical 
20 attention until symptoms develop, and several studies 
have indicated that removal of the iron after the 
development of tissue damage does not necessarily improve 
the organ function (Cundy, et al., Clin. En docrinol. . 
38:617-620 (1993); Westera et al., Am. J. Clin. Path. . 
25 99:39-44 (1993)). Hemochromatosis is an underdiagnosed 
and undertreated disease that would benefit greatly from 
early diagnosis and an effective treatment (for reviews 
see Edwards et al., Hosp. Pra ct. Supp l . . 3:30-36 (1991); 
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Edwards and Kushner, N . Enal .7 ^ 328:1616-1620 
(1993) ) . 



Untreated hemochromatosis is characterized by iron 
overload of parenchymal cells, which is toxic and in many 
5 cases a probable cause of various complications including 
hepatopathy (including cirrhosis, and liver cancer), 
arthropathy, hypogonadotropic hypogonadism, marrow 
aplasia, skin disorders, diabetes mellitus, and 
cardiomyopathy (for review see Halliday and Powell, iron 

10 ant * Human Dinftfmp , Lauffer, rb, (ed) . 131-160 (1992)). 

There are reportedly 1.5 to 2 million active cases of 
hemochromatosis within the U.S., with approximately 25% 
of late diagnosed or untreated patients developing 
hepatomas . 



15 



In untreated hemochromatosis, iron is universally 
deposited in the hepatocytes of the liver, and elevated 
saturation of transferrin with elevated serum ferritin 
levels combined with liver biopsy provides the best 
diagnostic test currently available (Fairbanks, Hoap. 
20 Pract . , 26:17-24 (1991)). The iron is found primarily in 
the cytoplasm of hepatocytes, and by electron microscopy 
in lysosomal vacuoles, and in more severe cases, iron is 
deposited in mitochondria (for review see Iancu, Peri. 
Pathol ■ ' 10:281-296 (1990)). Other liver toxins such as 
25 alcohol and hepatitis exacerbate the damage caused by the 
iron deposition (Piperno et al., J. Hepaf. . 16:364-368 
(1992) ) . Patients with hemochromatosis are advised not 
to drink alcohol, because of increased liver damage, or 
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to smoke tobacco products, as iron deposition can also 
occur in the lungs. 



Hemochromatosis is reported in the literature as an 
autosomal recessive disease in which the responsible 
5 gene(s) is linked to the A locus of the human MHC (HLA 
complex) , located on human chromosome 6 (Simon and 
Brissot, Hepatol . . 6:116-124 (1988)). Linkage to human 
HLA-A3 has been documented in approximately 73% of cases. 
However, other genetic loci also have been implicated, 
10 especially in African (Gorduke et al., N. Engl . J. Med. . 
326:95-100 (1992)) and African-American populations 
(Barton et' al., Blood . 85:95a (1993)). 



Hemochromatosis is presently recognized as the most 
common genetic malady in humans, far exceeding cystic 

15 fibrosis, phenylketonuria and muscular dystrophy combined 
(Leggett et al., Clin. Chem. . 36:1350-1355 (1990)). One 
explanation for the high incidence of this genetic 
disease may be that mutations in multiple linked genes 
produce a similar phenotype. Hemochromatosis occurs most 

20 frequently in populations of European origin with a 
frequency in homozygotes and heterozygotes of 
approximately 0.3 and 13%, respectively. 



Several markers, including the recently described 
D6S105, have been identified in the human MHC locus and 
25 have narrowed the genomic location of the hemochromatosis 
gene to within 1 centimorgan of the A locus (Jazwinska et 
al., Am. J. Hum. Genet. . 53:347-352 (1993)), and possibly 
centromeric to HLA-F (Gasparini, et al . , Hum . Mol . 
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Genet ■ , 5;57l-576 (1993)). Others have reported 
candidate hemochromatosis genes located 20-200 kb 
telomeric to HLA-A (el Kahloun et al . , Hum. Mnl r.^ n 
2:55-60 (1993)). while several of these candidate genes 
5 were thought to be single copy, three of the genes, 
termed HCG II, IV and VII, were found to be multicopy 
genes. Thus, despite the advances made in determining 
the location of the hemochromatosis gene, it has not yet 
been isolated. 



10 



Most animal models for iron overload are not 
entirely suitable for the study of hemochromatosis since 
they do not reflect enhanced iron absorption from tha gut 
by active transport. Mice homozygous for deletion of the 
gene encoding f^M (^-/-mice (Roller et al., Science, 
15 248:1227-1230 (1990); Zijlstra et al., Nature . 

344:742-746 (1990)), however, provide an excellent animal 
model for the study of hemochromatosis. These animals 
lack detectable class I proteins on the cell-surface, 
although biochemical labeling shows that class I gene 
20 products are being synthesized. Activated lymphocytes 
from Bj-/- animals can be lysed by activated natural 
killer (NK) cells, again suggesting a deficiency in class 
I expression (Liao et al., Sslsnss., 253:199-202 (1991)). 
These mice were originally developed to study the role of 
25 SjM in development. While the mice developed and bred 
normally, they failed to generate significant numbers of 
CD8+ T cells. Consequently, these mice have been 
intensely studied from an immunologic perspective. 
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&2-/-mice combat viral infections relatively well, 
although the course of the infection is longer than in 
normal animals (Eichelberger et al., J . Exp Med _ . 
174:875-878 (1991); Muller et al . , Nature . 255:1576-1579 
5 (1992)). They reject allografts (Zijlistra et al., 

Exp. Mfifl., 175:885-889 (1992)) and show higher levels of 
Ig production and faster class switching of antibody 
types than normal mice. Although GD8+ T cells are low to 
undetectable at birth, studies have shown that the 

10 animals can generate CD8+ T cells, and a cytotoxic CD8+ T 
cell response can be mounted under appropriate 
circumstances (Apasov and Stikovsky, J . Immunol . . 
152:2087-2097 (1994)). Another significant abnormality 
reported in these animals is that they develop 

15 hyperglycemia (glucose>300 mg/dl) in old age (greater 
than 2 years) . It has been suggested that the onset of 
diabetes in the Sa-/- mice is related to autoimmunity 
(Faustman et al . , Science, 254:1756-1761 (1991)), however 
this explanation has been disputed (Serreze et al., 

20 Diabftr.fifi, 43:505-509 (1994); Wicker et al . , Diabetes . 
43 :500-504 (1994) ) . 



S 2 -/-mice can develop iron overload that is similar 
to human hemochromatosis. £ 2 -/-mice can spontaneously 
develop hepatomas. This observation combined with the 

25 molecular biology data of the S-GAP genes (see Example 
I) , suggested that the mice would develop iron overload. 
Histochemical examination of tissues from these mice, 
confirmed this hypothesis. Iron was found deposited in 
the liver of all animals, and in the kidneys, spleen and 

30 lungs of some of the animals. In addition, 16% of the 
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animals developed liver disease, having either hepatomas 
or liver necrosis. Thus, the clinical findings for the 
fij-/- deficient mice are sufficiently similar to the 
pathology of hemochromatosis to make the -mouse an 
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organ- specific autoimmune diseases such as Addison's 
disease, Goodpasture's syndrome, Grave's disease, 
Hashimoto's thyroiditis, idiopathic thrombocytopenia 
purpura, myasthenia gravis or pernicious anemia. As 
5 hemochromatosis in humans is likely mediated by a fi-GAP 
promoter-driven leczyme, then treatment with a 
carbohydrate ligand, leczyme or competing molecule with 
the same or similar binding specificity as the leczyme 
involved in the disease may be used to modulate the 
10 disease process. A carbohydrate ligand that binds the 
nonclassical class I leczyme involved in hemochromatosis 
can be administered to inhibit binding to the fi-GAP iron 
carrier. 

A process to follow for using a carbohydrate ligand 
15 to treat a disease such as autoimmunity can first require 
identification of the leczyme that is involved in the 
disease process. Subsequently, a candidate carbohydrate 
ligand that can bind to the leczyme is identified by 
methods disclosed herein. Thus, such candidate 
carbohydrate ligands can then be tested in vitro to 
identify those efficient at blocking the autoimmune 
reaction exhibited when the leczyme on autoreactive 
immune cells from the diseased individual recognizes a 
carbohydrate molecule expressed on the cells of the 
individual that is the target of the autoreactive cell. 
The autoimmune reaction can be measured by' an increase in 
a cell function such as cell proliferation or release of 
cytokines (see for example, Colligan et al . supra, 1992; 
Mishell and Shiigi, supra, 1980) . The best candidate 



WO 96/17870 PCT/US95/15913 

47 

carbohydrate ligands can then be used as a medicament to 
treat the disease . 



The methods disclosed herein for the treatment of 
hemochromatosis are also suitable for the treatment of 
5 many other medical diseases or complication resulting 
from iron overload. Since multiple leczyme genes are 
involved in mediating control of iron metabolism, the 
type of mutation, its location in the gene and the number 
and type of leczyme genes mutated in an individual are 
10 factors that can effect the extent of iron overload in an 
individual. As the extent of iron overload exhibited by 
an individual is dependent on the above factors, then the 
methods disclosed herein to treat hemochromatosis are 
also applicable for treating other diseases resulting 
15 from iron overload. Such diseases include, for example, 
hepatopathy (including cirrhosis, and liver cancer), 
arthropathy, hypogonadotropic hypogonadism, marrow 
aplasia, skin disorders, diabetes mellitus, and 
cardiomyopathy (for review see Halliday and Powell, iron 
20 and Human Disfianp, Lauffer, rb, (ed) . i3i-ieo {1992)). 

In order to modulate hemochromatosis or other iron 
storage disease, the carbohydrate ligand or mimetope is 
administered in an effective amount. The total 

25 effective amount can be administered to a subject as a 
single dose, either as a bolus or by infusion over a 
relatively short period of time, or can be administered 
using a fractionated treatment protocol, in which the 
multiple doses are administered over a more prolonged 

30 period of time. One skilled in the art would know that 



WO 96/17870 PCT/US95/15913 

48 

the concentration of a carbohydrate ligand required to 
obtain an effective dose in a subject depends on many 
factors including the age and general health of the 
subject as well as the route of administration and the 
5 number of treatments to be administered and the chemical 
form of the carbohydrate ligand. In view of these 
factors, the skilled artisan would adjust the particular 
amount so as to obtain an effective amount for the 
subject being treated. 

0 A carbohydrate ligand or a leczyme can also be 

useful in vivo for the treatment of autoimmune diseases 
involving a leczyme. In autoimmune disease, a leczyme 
expressed on a lymphoid cell can recognize a 
self -carbohydrate ligand as a foreign carbohydrate 

5 ligand, resulting in immune -directed destruction of cells 
expressing the self -carbohydrate ligand. Thus, 
administration of a carbohydrate ligand, mimetope or 
other competing molecule that can bind to the leczyme 
involved in aberrant self -recognition can block lymphoid 

► cell recognition or activation leading to a reduction in 
symptoms or cessation of autoimmune disease. 
Alternatively, administration of a leczyme that has the 
same or similar binding specificity for the 
self-carbohydrate ligand recognized by a leczyme of the 
autoreactive lymphoid cell can also be used to treat the 
autoimmune disease. 

A carbohydrate ligand or a leczyme can be used to 
treat a disease state resulting from an infectious agent 
such as a virus, bacterium, yeast or parasite. 
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Infectious agents have evolved to express their own 
external receptors that can recognize carbohydrate 
structures or leczymes on the cell-surface, enabling 
entry of the agent into the cell to be infected. Thus, 
5 administration of an appropriate carbohydrate ligand or a 
leczyme to an individual exposed to an infectious agent 
can block the binding of the agent to target cells, 
subsequently inhibiting the extent of infection and 
thereby reducing the spread of the disease. 



10 



A carbohydrate ligand or a leczyme of the present 
invention can be used to treat transplantation rejection. 
Since rejection is based on the recognition of foreign 
molecules by lymphocytes of the transplant recipient, 
15 then treatment with a carbohydrate ligand that can bind 
to the leczyme of the transplant recipient's lymphocyte 
that is involved in foreign antigen recognition can 
inhibit recognition leading to transplantation rejection. 
Also, administration of a leczyme that has the same or 
20 similar binding site specificity as the leczyme of a 
transplant recipient's lymphocyte involved in foreign 
antigen recognition can inhibit recognition leading to 
transplantation rejection. 

A carbohydrate ligand or leczyme of the present 
25 invention is particularly useful when administered as a 
pharmaceutical composition containing a pharmaceutically 
acceptable carrier. Pharmaceutically acceptable carriers 
are well known in the art and include, for example, 
aqueous solutions such as a physiologically buffered 
30 saline or other solvents or vehicles such as glycols, 
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glycerol, oils such as olive oil or injectable organic 
esters . 



A pharmaceutically acceptable carrier can contain 
physiologically acceptable compounds that act, for 
5 example, to stabilize or to increase the absorption of a 
carbohydrate ligand or leczyme. Such physiologically 
acceptable compounds include, for example, carbohydrates, 
such as glucose, sucrose, dextrans, antioxidants, such as 
ascorbic acid or glutathione, chelating agents, low 

10 molecular weight proteins or other stabilizers or 

excipients. One skilled in the art would know that the 
choice of a pharmaceutically acceptable carrier, 
including a physiologically acceptable compound, depends, 
for example, on the route of administration of the 

15 composition. 

One skilled in the art would know that a 
pharmaceutical composition containing a carbohydrate 
ligand or leczyme can be administered to a subject by 
various routes including, for example, by direct 

20 instillation, orally or parenterally, such as 

intravenously, intramuscularly, subcutaneously or 
intraperitoneally. The composition can be administered 
by injection or by intubation. The pharmaceutical 
composition also can be incorporated, if desired, into 

25 liposomes or microspheres or can be microencapsulated in 
other polymer matrices (Gregoriadis, Liposome Technology. 
Vol. 1 (CRC Press, Boca Raton, FL, 1984), which is 
incorporated herein by reference) . Liposomes, for 
example, which consist of phospholipids or other lipids, 

30 are nontoxic, physiologically acceptable and 
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metabolizable carriers that are relatively easy to make 
and administer. 



An expression vector encoding a leczyme can be 
administered in vivo to treat a disease state resulting 
5 from a leczyme. For example, a disease state resulting 
from a mutated leczyme, such as anemia, can be treated by 
administering an expression vector encoding a functional 
leczyme involved in iron transport and obtaining 
expression of the vector in cells of the digestive tract. 

10 The level of expression of a particular leczyme in a 

cell can have a impact on the nature of a carbohydrate 
ligand expressed by the cell, if expression of a 
particular carbohydrate ligand is involved in a disease 
process, the ligand can be eliminated from a cell by 
15 reducing the expression of the leczyme responsible for 
producing the ligand. Thus, an expression vector can 
contain an exogenous nucleic acid molecule encoding an 
antisense nucleotide sequence that is complementary to a 
nucleotide sequence encoding a portion of a leczyme such 
20 that when introduced into a cell under the appropriate 
conditions, the expression vector can produce an 
antisense nucleic acid molecule, which can selectively 
hybridize to the leczyme gene or message in a cell and, 
thereby, affect the expression of the leczyme in the 
25 cell. For example, the antisense nucleic acid molecule 
can hybridize to a leczyme gene in the cell and can 
reduce or inhibit transcription of the leczyme gene. 
Also, the antisense molecule can hybridize to the message 
encoding the leczyme in the cell and can reduce or 
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inhibit translation, processing and cell stability or 
half-life of the RNA. 



Expression vectors also can be used to effect the 
expression of a leczyme or of a carbohydrate ligand 
5 involved in a disease state by introducing into a cell an 
exogenous nucleic acid molecule encoding a ribozyme that 
can specifically cleave RNA encoding the leczyme or 
peptide backbone of a carbohydrate ligand. Thus, by 
introducing the ribozyme into cells involved in a disease 

10 process, one can reduce expression of the leczyme or 

carbohydrate ligand involved in the disease and therefore 
reduce or inhibit the disease process. An antisense 
nucleic acid molecule or a ribozyme can be chemically 
synthesized and incorporated into an expression vector 

15 using recombinant DNA techniques. The antisense nucleic 
acid molecule or ribozyme also can be added directly to a 
cell without having been incorporated into an expression 
vector . 



Methods for introducing an expression vector into 
20 cell are well known in the art. Such methods are 

described in Sambrook et al, supra, 1989; Kriegler M. 

Gene Transfer and Expression: A Laboratory Manual (W. H. 

Freeman and Co. New York NY (1990), both of which are 

incorporated herein by reference) and, for example, 
25 include transfection methods such as calcium phosphate, 

elect roporat ion, lipofection, or viral infection. 



Recombinant viral vectors are available for 
introducing an exogenous nucleic acid molecule into a 
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mammalian cell and include, for example, adenovirus, 
herpesvirus and retrovirus -derived vectors. For example, 
a viral vector encoding a leczyme can be packaged into a 
virus to enable delivery of the genetic information and 
5 expression of these leczyme in gastrointestinal 

epithelial cells following infection by the virus. Also, 
a recombinant virus which contains an antisense sequence' 
or a ribozyme specific for a nucleotide sequence encoding 
a leczyme can introduced into a cell in an individual to 
10 inhibit a disease state mediated by the leczyme or a 

leczyme with a similar carbohydrate binding specificity. 

Recombinant viral infection can be more selective 
than direct DNA delivery due to the natural ability of a 
virus to infect only certain types of cells. This 
15 natural ability for selective viral infection can be 
exploited to limit infection to only certain cell types 
within a mixed cell population. For example, 
adenoviruses can be used to restrict viral infection 
principally to cells of epithelial origin. In addition, 
20 a retrovirus can be modified by recombinant DNA 

techniques to enable expression of a unique receptor or 
ligand that provides further specificity to viral gene 
delivery. Retroviral delivery systems that provide high 
infection rates, stable genetic integration and high 
25 levels of exogenous gene expression are well known in the 
art. 



As described above, recombinant viral delivery 
systems exist that provide the means to deliver genetic 
information into a selected type of cell. The choice of 
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viral system will depend on the desired cell type to be 
targeted, while the choice of vector will depend on the 
intended application. Recombinant viral vectors are 
readily available to those in the art and can be easily 
5 modified by one skilled in the art using standard 

recombinant DNA methods (see, for example, Krieger, Gene 
Transfer and Expression: A Laboratory Manual. (W.H. 
Freeman and Company, 1990); Goeddel, Methods in 
Enzvmoloqy, vol. las. (Academic Press, 1990); and Stoker, 
10 In Molec. Virol.. A Practical Approach (eds. Davison and 
Elliott, IRL Press, 1993), all three of which are 
incorporated herein by reference) . 

A genetic predisposition for hemochromatosis or 
other iron storage diseases based on a leczyme can be 

15 diagnosed by detecting a mutation in the heavy chain of a 
class I MHC molecule encoded for by a gene in the MHC 
locus. These methods can be used to diagnose an 
individual having the symptoms of an iron storage 
disease. A positive diagnosis of mutation in an 

20 individual's heavy chain is useful to verify the 

underlying cause of the disease and by identifying the 
particular leczyme that is mutated. The identification 
of the mutated leczyme can be used with the methods 
disclosed herein to identify a carbohydrate suitable for 

25 treating the disease. 



An individual who does not have an iron storage 
disease, but is suspected of inheriting a mutation that 
can predispose the individual to develop an iron storage 
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disease later in life can also benefit from testing for 
mutations by the methods disclosed herein. 

A mutation that is diagnostic for the disease is one 
that results in a significantly reduced affinity of the 
5 heavy chain for human l^M. For example, a mutation in a 
nonclassical class I heavy chain that results in deletion 
of a signal for phosphorylation is a mutation that is 
diagnostic for hemochromatosis since a properly 
phosphorylated heavy chain is necessary for the chain to 
10 interact with l^M. Consensus amino acid sequences that 
signal a cell to phosphorylate a serine or a threonine 
residue in a polypeptide are well known in the art. A 
mutation that is diagnostic for hemochromatosis also can 
occur in a region of the heavy chain that is near to a 
15 phosphorylation site. Such a mutation can reduce the 
ability of the heavy chain to associate with SjM if the 
phosphate group added to this site cannot be removed in a 
cell. 



Methods to detect a phosphorylation site mutation in 
20 a nonclassical class I heavy chain can be based either on 
analysis of the protein or the nucleic acid encoding the 
protein. For protein determination, the nonclassical 
class I molecule can be purified from a source of cells 
or body fluids of an individual and the heavy chain can 
25 be isolated from l^M. Methods to purify a class I MHC 
molecule and isolate the heavy chain from S,M are well 
known in the art. The isolated heavy chain can then be 
subjected to amino acid sequencing, peptide mapping or 
other such protein analyses to determine if the sequence 
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a phosphorylation site has been mutated. Such methods 

for protein determination are well known to those in the 
art. 



A mutation in a nucleic acid sequence can be 
5 detected by various methods to analyze nucleic acids such 
as by nucleic acid sequencing, polymerase chain reaction 
or hybridization. Such methods are well known to those 
in the art (see, for example, Sambrook et al, supra, 
1989; Hames and Higgins Nucleic Acid Hybridisation : a 
10 practical approach (IRL Press, New York, 1985), both of 
which are incorporated herein by reference) . 

Methods to detect decreased binding of a mutated 
heavy chain with BjM can be used for diagnosing an iron 
storage disease such as hemochromatosis. In these 

15 methods, the heavy chain of an class I MHC molecule is 
isolated from an individual and contacted with J^M under 
conditions suitable for a non-mutated such heavy chain to 
associate with SjM. A control reaction, which contains a 
non-mutant form of the same or similar class I heavy 

20 chain to the one being tested for a mutation is performed 
in parallel. After contacting the heavy chain with i^M, 
the reaction is incubated under suitable conditions, 
including, for example, an appropriate buffer 
concentration and pH and time and temperature, which is 

25 sufficient for the control heavy chain to associate with 
SjM. The heavy chain being tested from the individual is 
considered to have a mutation diagnostic for an iron 
storage disease when the fraction of this heavy chain 
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that associates with I^M is significantly less than the 
fraction of control heavy chain that associates with J^M. 

The association of a class I heavy chain with can 
be detected, for example, by attaching one of the 
5 molecules to a solid support and attaching a detectable 
label such as a radionuclide or a fluorescent label to 
the other molecule and measuring the amount of detectable 
label that is associated with the solid support, wherein 
the amount of label detected indicates the amount of 
10 association of the heavy chain with J^M. 

The following examples are intended to illustrate 
but not limit the present invention. 



EXAMPLE T 



CLQNINfi AND EXPFFqqTOM of tup; fi _ nap r .p W 

15 This example provides an approach to identify and 

clone leczyme genes from various species of animal to 
elucidate their role in iron metabolic diseases. 

Cloning of rhf> moum» ft-r. an flpn PS 

Genomic or libraries were constructed by partial Hae 
20 III digestion of DNA from A/J and Balb/c mouse liver and 
cloning the fragments into the vector Charon 4A. The 
libraries were screened with the S15 probe, isolated from 
the H-2L* gene (Margulies et al . , Nature 295:168-170 
(1982), which is incorporated herein by reference)). S15 
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is a 3' class I MHC mouse probe and consists of 522 base 
pairs including 36 base pairs of exon 4 encoding the 
alpha-3 domain and 486 base pairs of intron (Evans et 
al., Prnr. Natl. Acad. Sci. (USA) . 79:1994-1998 (1982)). 
5 Probes were prepared by excising the insert from M13 RF 
or pUC18, purifying the fragment from disulfide 
cross -linked acrylamide gels (Hansen, Anal. Biochem. 
116:146-151 (1981)), and labeling with 32 P to a specific 
activity of >10 e cpm/ag by nick translation (Rigby et 
10 al., J. Mol. Biol. 113:237-251 (1977)). Libraries were 
screened using standard colony hybridization techniques 
(for details see Sambrook et al., supra, 1988). 



Seventeen unique a clones were isolated from the 
libraries and were subjected to restriction enzyme 

15 digestion mapping. BamHI digestion and gel. 

electrophoresis of these clones revealed five from the 
A/J strain and one from the Balb/c stain that contained a 
unique 500 base pair (bp) BamHI restriction fragment 
(BB500) . The six clones containing the unique fragment 

20 were subjected to BAMHI digestion, the BB500 fragment was 
gel purified and subcloned into M13 vector Mpl8 and mpl9 
(Yanisch-Perron et al . Gens 33:103-119 (1985)). DNA 
sequences were determined by the chain termination method 
(Sanger, et al . , Prpg. Natl, Acad, Sci, (PSA? 

25 74:5463-5467 (1977)) using 35 S- ATP. Reactions were 

analyzed on 6% urea-polyacrylamide gradient gels (Biggins 
et al., Pror. Natl. Acad. Sci. (USA) 80:3963-3965 
(1983)) . DNA sequences were assembled and analyzed using 
the University of Wisconsin Computer Group Programs 
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(Devereux, et al . . Nuclg-in acids 12:387-395 (1984)) 
run on a VAX-ii/785 computer. 

DNA sequence comparisons demonstrated that the BB500 
fragments share greater than 93% sequence homology, a 
5 region within the BB500 fragment shows 100% sequence 
homology between the a clones and has been termed S-GAP 
(globin analogous promoter) since it is a regulatory 
motif that shares sequence homology with mouse, rabbit or 
human S-globin promoters (for a detailed comparison see 
10 ahead) . There is close similarity between all six 

fragments (called S-GAPl-6) with the minor exception of 
E-GAP4 where an 8 base pair sequence AAGAGGAG, 
immediately downstream of a CCAAT element, has been 
deleted. There are other minor differences between these 
15 sequences, and the or clones they have been isolated from 
demonstrate different restriction patterns confirming 
that the various a clones contain unique sequences and 
are not a cloning artifact. Thus, the A/ J strain mouse 
contains at least five highly homologous S-GAP sequences 
20 within its genome. 

MflPP i na the ft-GAP Seoiienms Map 1-q Phromo R n me n in «-h» 
Mqusp 

Southern blotting was used to determine if the BB500 
sequence could identify genes located on chromosome 17 of 
25 the mouse. DNA from several Chinese hamster ovary (CHO) 
mouse somatic hybrid cell lines were evaluated by 
Southern blotting using the S-GAP6 BB500 probe. Genomic 
DNA was isolated from cultured cells, digested with 
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EcoRl, electrophoresed on 0.8% agarose gels and 
transferred to a nitrocellulose membrane. Hybridization 
with the BB500 probe was carried out in the presence of 
dextran sulfate under the conditions described by Meinkof 
5 and Wahl f Anal. Biochem. 138:267-284 (1985)) with a final 
wash in 0.2X sodium chloride sodium citrate buffer, pH 
7.0 (SSC) at 60°C. 

The BB500 probe hybridized with the HM27 cell line 
containing the DNA from mouse chromosomes 15 and 17 and 

10 revealed the same banding pattern as with total genomic 
BALB/c DNA. The cell line HM65 that lacks BALB/c 
chromosome 17 was devoid of hybridizable bands, 
indicating that the probe did not bind nonspecif ically to 
CHO DNA. DNA from other CHO cell lines containing mouse 

15 chromosomes other than chromosome 17 were examined by 

Southern blotting with the BB500 probe and were found to 
be negative (not shown) . These results indicate that the 
S-GAP sequences all map to chromosome 17 in the mouse. 



Mapping the fi-GAP Sequences to the Murine O/TL Complex 

20 The fact that the S-GAP sequences were isolated from 

the mouse genome provided several powerful tools to 
precisely map the location of the sequences. First, the 
murine MHC is highly characterized, particularly with 
respect to the nonclassical class I region and, secondly, 

25 congenic strains of mice exist where the position of 
genes in the MHC can be pinpointed. Congenic strains 
were originally developed by breeding strains of inbred 
mice together. Subsequent generations of chromosomal 
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crossing over has produced a number of strains which 
contain a portion of the MHC from one strain and the 
remainder of the MHC from another strain. Consequently, 
it is possible to compare restriction fragment length 
5 polymorphism (RFLP) between the strains, and determine if 
the banding patterns are linked to a given MHC locus (for 
review see Klein, Nature m B h flrv n f ^ M n irr 

Hl BtQcnmpflribnirv Comply , 50-73 usee)), rflp analysis 

was performed by obtaining purified genomic DNA from the 
10 various mouse strains, digesting the DNA with EcoRI and 
performing Southern hybridization with the G-GAP6 BB500 
probe as described above. The Southern blot showed that 
the probe identified up to ten different bands from the 
DNA of the mouse strains tested (Table 1) . Four of these 
15 bands, 30 kb, 20 kb, 16 kb and 10.5 kb, were mapped 

within the MHC locus. The RFLP analysis indicated that 
there were at least four to six copies of the S-GAP 
sequences/genome depending on the strain of mouse tested. 
In addition genetic analysis of the RFLP patterns 
indicated that the 30 kb and 10.5 kb 6-GAP bands mapped 
to Q region between 01 and Q4 while the 20 kb and 16 kb 
S-GAP bands mapped to the T region. In addition, two of 
the E-GAP sequences that did not demonstrate RFLP 
polymorphism were mapped telomeric to the classical class 
25 1 genes. 



20 



Locating thft fl-OAP SBmi»n™« pirPrMv M -iaePnf «■„ 

MpncJasRicai n a « B T ffrnrfT 

The two of the S-GAP gene sequences that were mapped 
to the Q region between oi and Q4 , were directly linked 
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to Ql and Q2 by DNA sequence analysis of Ql and Q2 genes 
isolated from a C57BL/6 (H-2 b ) a library. Sequencing 
showed that both the Ql and Q2 genes are associated in a 
head to head configuration with an unknown gene 

5 (currently defined as the S-GAP gene) with both genes 

transcriptionally regulated by a single promoter/enhancer 
region having two promoters defined by a pair of CAAT and 
TATA boxes located about 25 bp apart on opposite strands 
of the DNA. Thus, having intact promoters and a common 

D regulatory region, the class I and 6-GAP genes would be 
transcribed from opposite strands, with the class I genes 
Ql or Q2 transcribed from 5 ' to 3 1 on the top strand and 
the S-GAP gene transcribed from 5 1 to 3 1 on the bottom 
strand. 



The sequence analysis of Ql and Q2 genes from 
C57BL/6, as well as a TL gene from A/J (H-2 a , Watts et al. 
EMBQ J. 8:1749-1759 (1980)) indicated that S-GAP promoter 
and regulatory regions had replaced the typical classical 
class I -type 5 1 regulatory sequences known to be involved 
in the regulation of classical class I genes. The S-GAP 
promoter is an active promoter since it is known that the 
Q2 gene expresses a gene product that can be detected in 
the intestine (Wang et al., Immunoaenet . , 38:370-372 
(1993)). These results indicate that the S-GAP promoter 
regulates the expression of some nonclassical class I 
genes . 
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Head to head gene structure with co-regulation 
of the genes has been previously described in organisms 
ranging from bacteria to humans, indicating that 
co-regulation is a widely adopted strategy. (Brickman et 
5 al., J. Molec. Biol . . 212:669-682 (1990); Xu and 

Doolittle, Proc. Natl. Acad, Sci. (USA) - 87:2097-2101 
(1990) ; Lennard and Fried, Molec. Cell . Biol . . 
11:1281-1294 (1991); Heikkila et al . , J. Biol. Chgm. . 
268:24677-24682 (1993); Fererjian and Kafatos, Dev. 

10 BiQl. , 161:37-47 (1994); Sun and Kitchingman, Nucleic 

Acids Res,/ 22:861-868 (1994)). In both prokary otic and 
eukaryotic systems, interaction between, or linkage in a 
metabolic pathway of two gene products has been suggested 
(Galvalas, et al., Mol . Cell. Biol. . 13:4784-4792 (1993); 

15 Lightfoot et al., Br. J Cancer . 69:264-2673 (1994)). It 
should be noted that in the S-GAP clones so far studied, 
the Ql and Q2 genes still possesses their own CAAT and 
TATA elements, and it is only the typical classical class 
I regulatory enhancer regions which are absent. 

20 Conservation of the fi-GAP Sequences Across Species 

To demonstrate that the S-GAP sequences are 
conserved, and that various species, including human, 
contain multiple copies of these genes a "Zoo blot" of 
various species of genomic DNAs was digested with EcoRI 
25 and analyzed by Southern blotting using the murine 6-GAP6 
BB500 probe. Under low stringency the blot showed 
detection of a multiplicity of bands in DNA from human, 
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rat, mouse, dog, rabbit and monkey. This indicates that 
multiple copies of the S-GAP sequences were found in many 
species including human. In addition, the conservation 
of the B-GAP multigene family predates speciation of 
5 murine and human and therefore is not the product of a 
recent gene duplication or rearrangement. The 
demonstration of interspecies sequence homology is 
significant because, in general, exons and regulatory 
regions tend to be conserved. Additional experiments 
10 show sequence homology for (5-GAP in rabbits, mice, 
chickens and humans. Thus, the pattern of specific 
regions of retained homology suggests that the S-GAP 
sequences are retained by selective pressure. 

Homojpcrv BffwrPTi Thr P-r,AP q^^ , anri ^ Drnnrrr rn 

15 for R-Tl^ft-jn 



Sequences within all six of the 500 bp S-GAP 
clones show striking sequence and positional homology to 
mouse, rabbit and human S-globin promoter regulatory 
elements. Important regulatory elements within a 106 bp 
20 region of the S-globin promoter have been characterized 
(Myers et al., Sc i ence 232:613-618 (1986); Stuve and 
Myers, MC- I, Oil BiQl 6:3350-3358 (1990)). Using 
saturation mutagenesis and 5' deletion promoters, Myers 
and his colleagues constructed a series of mutants that 
25 were used to identify four regulatory sequences. The 

four regulatory motifs were located between positions -95 
and -26 which contain a CACCC element (positions -95 to 
-87) , CCAAT and TATA box motifs at positions -79 to -72 
and -30 to -26, respectively, and a llbp repeat element 
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located between the CCAAT and TATA boxes (positions -53 
to -32) that contains 2 imperfect duplicated repetitive 
elements (SDRE) . The fact that these EDRE are essential 
for the expression of globin genes has been shown by 
5 deletional studies . 

Comparison of the six 500 bp fi-GAP sequences 
with the fi-globin promoter sequences from various species 
showed several striking sequence homologies to fi-globin 
regulatory elements. Analysis of the S-GAP sequence in 

10 this region revealed 5 regulatory motifs found in the 

fi-globin promoters. These include the 5* CACCC erythroid 
element between positions -127 to -123, CCAAT and TATA 
box motifs between positions -109 to -105 and -30 to -26, 
respectively, the cap consensus sequence positions -13 to 

15 -10, and a fifth and more complex regulatory element 

involved a fi-globin fiDRE of a 10 and 11 bp sequence (base 
pair numbering was determined from sequence alignments 
with gaps inserted and does not reflect the true base 
pair position from the transcriptional start site) . 



20 In all the S-GAP clones, two of the four fiDRE 

regulatory motifs were flanked by the CCAAT and TATA 
elements between positions -54 and -32, while two other 
SDREs were found immediately upstream of the TATA box 
(positions -11 to +1 and +3 to +12) . All of these fiDRE 

25 were conserved in sequence, and moreover, two of them 
were conserved in position (-54 and -32) . It is 
significant that the fiDREs conserved in S-GAP were 
conserved in globins from multiple species (mouse, 
rabbit, chicken and human) covering more than 100 million 
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years of evolution. This observation of evolutionary 
conservation indicates the E-GAP genes are old genes. 

A final putative regulatory motif from the 
S-GAP clones was AGATAA (nucleotides -82 to -77), which 
5 is identical to the DNA consensus sequence for the 

transcriptional binding factors NF-El . This family of 
DNA binding proteins (NF-Ela, b, and c) are involved in 
the erythroid and/or T-cell specific expression of many 
genes, including mouse and chicken adult S-globin, the 
10 heme pathway enzyme porphobilinogen (PGB) deaminase, the 
T-cell receptor and the leukemia virus HTLV III. 

A closer inspection of the regions of homology 
between the S-GAP and mouse B-globin promoters reveals 
several features: 1) is of 26 base pairs match at 
15 positions -35 to -io encompassing the consensus TATA 
motif (Bucher, J. MPl , Rip] 212:563-578 (1990)); 2) a 
region encompassing the S-GAP CCAAT box, positions -113 
to -109 contains the S-globin regulatory element CACCC 
which has been shown to be essential for the appropriate 
20 expression of S-globin in erythroid cells; 3) a perfect 
match of the CCAAT element exists at positions -109 to 
-105; 4) the fourth matching region encompasses a EDRE 
element, located between the CCAAT and TATA boxes at 
positions -64 to -45 (this region contains 16 of 19 bp 
25 matches with no gaps); and 5) a consensus cap site 

sequence as defined by Bucher (Bucher, supra. 1990) and a 
putative transcriptional start site is identified at 
nucleotides -13 to +1 . 
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Several other putative regulatory sequences are 
apparent in the S-GAP promoter. Between positions -68 
and -37 and beginning 5 nucleotides distal of the TATA 
element are 4 palindromes. The 5 base pair repeat TCAGA 
5 appears twice within 24 base pairs. These repeats flank 
and are found within a globin-like imperfect direct 
repeat element (positions -57 to -47) . Two longer 
palindromes with imperfect dyad symmetry of 12 bp, and 15 
bp, positions -67 to -56 and -51 to -37, respectively, 
( 10 contain smaller internal palindromes of 7 bp, CCTCAGG 
(-66 to -60) and 5 bp repeat, TCAGA (-46 to -42), 
respectively. This S-GAP 33 bp SDRE-like region 
combining the two large 12 and 15 bp imperfect 
palindromes, the E-globin imperfect direct repeat element 
15 and the two TCAGA palindromic repeats shows about 50% 
(16/33) nucleotide sequence homology to the mouse 
fi-globin promoter. 

Expression of Genes Immediately downstream from the g-GAP 
20 Sequences in the Gastrointestinal tract: 

The pattern of specific regions of retained 
homology between the S-globin regulatory motifs and S-GAP 
promoters suggests: 1) the sequences have diverged from a 
common ancestral gene; and 2) the preserved regions in 

25 the fi-GAP sequences play a critical role in the 

regulation of expression of their respective genes. 
Furthermore, the homology to promoters for genes 
intimately involved in iron metabolism, the occurrence of 
erythroid specific regulatory sequences, and the close 

30 proximity of these genes to the human locus responsible 
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for hemochromatosis, indicates a role for the S-GAP genes 
in iron metabolism. 



To demonstrate that the S-GAP promoters 
regulate downstream messages, it is imperative to show 
5 that the associated genes encode transcribable messages. 
Moreover, such messages should be expressed in tissues 
involved in iron absorption, i.e. the gastrointestinal 
tract, if they are to be involved in the pathogenesis of 
hemochromatosis . 



10 



Northern blotting was performed with poly A+ 
RNA from various organs including the gastrointestinal 
tract. The blot was developed using a probe derived from 
a S-GAP (Q2 b ) cosmid clone. Total cellular RNA was 
prepared by the TRlzol™ Reagent method according to the 
15 manufacturer's instruction (Gibco/BRL, Gaitherburg, MD) . 
poly A+ or mRNA was purified by oligo dT cellulose 
chromatography (Strategene, San Diego, CA) . RNA was 
analyzed on f ormaldehyde-agarose gels and transferred to 
Zeta Bind membranes as previously described (Evans, et 

20 al " PrOC Nat-1. AraH fyy , ^ d-,..^-^ ^ 1(7 (1984)) 

The cosmid clone containing a 15- GAP sequence that was 
used for the probe was obtained from a a library. The 
clone was digested with ApaLI and Kpnl to yield a lOkb 
fragment. The fragment was partially digested with BamHI 
25 to yield a 2 kb fragment encompassing the S-GAP sequences 
and an 8kb fragment further downstream containing the 
coding sequences for a S-GAP gene. 
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Northern blotting with showed the presence of 
polydisperse messages produced in tissues from stomach, 
duodenum, jejunum, kidney of 4-5 kb and 8 kb in size. 
The kidney showed less polydispersity with only the 6 kb 
5 band predominating. These results indicate that S-GAP 
promoter and upstream S-GAP coding sequences are 
expressed in the gastrointestinal tract and are 
associated with members of a multigene family of which 
the 5 Kb message of the jejunum is most prominent. The 
10 fact that this probe also recognized a band in the kidney 
and stomach, suggested that related members of a S-GAP 
family can be functioning in other tissues. 

The size and complexity of the StGAP mRNA 
products detected by northern blotting is consistent with 

15 fi-GAP genes coding for a family of large proteins. These 
characteristics are more like those of a mucin protein 
family rather than an ion transport family of molecules. 
The homology to fi-globin promoters, the occurrence of 
erythroid specific regulatory sequences and close 

20 proximity of nonclassical class I and fi-GAP genes to the 
locus responsible for hemochromatosis in humans, an 
inheritable disease of iron metabolism indicates a role 
for the fi-GAP genes and the nonclassical class I genes in 
iron metabolism. With this information in hand and the 

25 facts disclosed herein that S 2 M-knockout mice have an 
unusually high incidence of hepatomas led to the 
understanding that these mice have a metabolic and 
pathological condition similar to hemochromatosis. 
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I so l ation of murine B-gap mwAs f™ m ^ moilaf> -i^jn nfil 

library, 



A S-GAP cosmid probe is used to screen a mouse 
jejunal cDNA library (Strategene a ZAP Express kit) . 
5 Northern blots are used to indicate that the messages 
recognized by the S-GAP probe are expressed in the 
tissues examined. Clones which give positive signals are 
isolated, characterized and purified. DNA purified from 
the selected is digested with EcoRI and subjected to 
10 Southern blot analysis. The blot is probed with the 800 
bp E-GAP probe to verify that they contain the S-GAP 
genes . 

ClpniTKT of fhe human ft- Gan ftAnoe 

A human genomic DNA library produced in sCOS 
15 cosmid vector is prepared as described previously for 
producing a mouse genomic library in sCOS (Strategene, 
San Diego CA) . The isolation of the human S-GAP genes 
from the human sCOS cosmid library is performed by 
screening clones with a class I MHC probe. The probe is 
20 generated from exons 4 and 5 of the HLA-A2 gene, which 
encodes the highly conserved SjM binding domain and the 
transmembrane region. Selected clones containing class I 
sequences are detected, and the Cosmids from these clones 
are purified, cut with the restriction enzyme EcoRI, run 
25 on a 0.7% agarose gel and blotted onto a charged nylon 
membrane. The blot is hybridized with the class I 
probe, striped and rehybridized with the 800 bp S-GAP 
probe . 
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EXAMPLE II 

&M KNOCKOUT MICE DEVELOP IRON OVERLOAD SIMILAR TO 

HEMOCHROMATOS IS 

This example provides a method to analyze iron 
5 deficiency in an animal model where an MHC-encoded 
leczyme function has been genetically deleted. In 
addition, these mice are useful for evaluating the in 
vivo utility of carbohydrate ligands on the treatment of 
hemochromatosis and various iron related diseases such as 
10 atherosclerosis, arthritis or cancer. 

The data concerning iron overload in the i^M 
knockout mice is contained in Rothenberg and Voland, 
1994. Histologic examination of tissues from 12-18 month 
old knockout mice, contained on a standard diet, revealed 

15 evidence of hepatic necrosis. Iron stains of the tissues 
revealed iron deposition in the liver of all animals, and 
in the kidney, or the lung of approximately 10% of the 
animals. The standard diet contains 3 50mg/kg Ferric 
carbonate. When animals were placed on a "breeder diet", 

20 which contains in addition to ferric carbonate, lOmg/kg 
ferrous sulfate, iron stores rose dramatically. Iron 
deposition in the animals was also age related with the 
highest levels of iron seen in the oldest animals. 
Together these data indicate that the E 2 M- knockout mice 

25 develop iron overload that is diet and age related. In 
addition we have shown that the animals develop hepatomas 
and others have reported that older animals develop 
diabetes (Faustman et al . , Science 254: 1756-1761 
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<199D). This constellation of pathologies mirrors human 
hemochromatosis . 



Although the invention has been described with 
5 reference to the examples provided above, it should be 
understood that various modifications can be made without 
departing from the spirit of the invention. Accordingly, 
the invention is limited only by the claims. 
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I Claim: 

1. A composition, comprising a substantially 

purified carbohydrate ligand that specifically binds to a 
leczyme . 

5 2. The composition of claim 1, further comprising a 

polypeptide or lipid moiety bonded to said carbohydrate 
ligand. 

3. A composition, comprising a substantially 
purified carbohydrate ligand, wherein said carbohydrate 

10 ligand is an oligosaccharide* 

4. A composition, comprising a substantially 
purified carbohydrate ligand, wherein said. carbohydrate 
ligand is a polysaccharide. 

5. A method of identifying a carbohydrate ligand 
15 that binds to a leczyme, comprising the steps of: 

a. contacting a sample containing a 
carbohydrate ligand with a leczyme suspected to bind such 
a ligand; and 

b. detecting whether binding of the ligand 
20 and the leczyme has occurred. 

6. The method of claim 5, wherein said leczyme is 
an MHC-derived gene product . 
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7 ' The method of claim 6, wherein said MHC-derived 

gene product is a class I molecule. 

8- The method of claim 5, wherein said MHC-derived 

gene product is a class II molecule. 

5 9. The method of claim 5, wherein said MHC-derived 

gene product is an nonclassical class I molecule. 



10. 

a non-MHC derived gene product. 



The method of claim 5, wherein said leczyme is 



11. The method of claim 5, wherein said 

10 carbohydrate ligand is bonded to a polypeptide or lipid 
moiety. 



12. The method of claim 5, wherein said 

carbohydrate ligand or said leczyme contains a detectable 



label 



15 13 • Th e method of claim 5, wherein said 

carbohydrate ligand or said leczyme is associated with a 
cell . 
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14 . A method of identifying a leczyme that binds to 
a carbohydrate ligand, comprising the steps of: 

a. contacting a sample containing a leczyme 
with a carbohydrate ligand suspected to bind such a 

5 leczyme; and 

b. detecting whether binding of the leczyme 
and the ligand has occurred. 

15. The method of claim 14, wherein said leczyme is 
an MHC-derived gene product. 

10 16. The method of claim 14, wherein said leczyme is 

a non-MHC derived gene product . 

17. The method of claim 14, wherein said 

carbohydrate ligand is bonded to a polypeptide or lipid 
moiety. 

15 18. The method of claim 14, wherein said 

carbohydrate ligand or said leczyme contains a detectable 
label . 



19. The method of claim 14, wherein said 

carbohydrate ligand or said leczyme is associated with a 
20 cell. 
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20 • A method of purifying a carbohydrate ligand 

that specifically binds to a binder, comprising the steps 
of: 



a. contacting a sample containing a 

5 carbohydrate ligand with a binder capable of binding such 
a ligand to form a ligand-binder complex; 

b. separating the complex from the rest of 
the sample; and 



c. dissociating the ligand from the complex 
10 to obtain the purified ligand. 

21 * The method of claim 20, wherein said binder is 

a leczyme. 



22 • The method of claim 20, wherein said binder is 

an antibody. 

2 3. The method of claim 20, wherein said leczyme is 
an MHC-derived gene product. 

2 4 . The method of claim 20, wherein said leczyme is 
a non-MHC derived gene product. 



25. The method of claim 20, wherein said 

20 carbohydrate ligand is bonded to a polypeptide or lipid 
moiety. 
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26. A method of purifying a leczyme that 

specifically binds to a carbohydrate ligand, comprising 
the steps of : 

a. contacting a sample containing a leczyme 
5 with a carbohydrate ligand capable of binding such a 

leczyme to form a ligand- leczyme complex; 

b. separating the complex from the rest of 
the sample; and 

c. dissociating the leczyme from the complex 
10 to obtain the purified leczyme. 

27. The method of claim 26, wherein said leczyme is 
an MHC-derived gene product. 

28. The method of claim 26, wherein said leczyme is 
a non-MHC derived gene product. 



29. The method of claim 26, wherein said 

carbohydrate ligand is bonded to a polypeptide or lipid 
moiety. 
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30. A method of identifying a carbohydrate ligand 

that modifies the function of a leczyme-expressing cell, 
comprising the steps of: 



a. contacting a sample containing such a 

5 carbohydrate ligand with a leczyme-expressing cell; and 

b. subsequently assaying the cell to 
determine its function. 



31. 

an MHC-derived gene product 



The method of claim 30, wherein said leczyme is 



10 32. The method of claim 30, wherein said leczyme it 

a non-MHC derived gene product. 

33. The method of claim 30, wherein said 

carbohydrate ligand is bonded to a polypeptide or lipid 
moiety. 



15 34 * A method of identifying a leczyme that modif 

the function of a carbohydrate ligand- expressing cell, 
comprising the steps of: 



a. contacting a sample containing such a 
leczyme with a carbohydrate ligand-expressing cell; and 

20 

b. subsequently assaying the cell to 
determine its function. 



les 
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35. The method of claim 34, wherein said leczyme is 
an MHC-derived gene product. 

36. The method of claim 34, wherein said leczyme is 
a non-MHC derived gene product. 

5 37. A method of modifying the function of a 

leczyme -expressing cell, comprising contacting the cell 
with a carbohydrate ligand that binds the leczyme. 

38. The method of claim 37, wherein said leczyme is 

an MHC-derived gene product. 

10 39. The method of claim 37, wherein said leczyme is 

a non-MHC derived gene product. 

40. The method of claim 37, wherein said 

oligosaccharide ligand is bonded to a polypeptide or 
lipid moiety. 

15 41. A method of modifying the function of a 

carbohydrate ligand-expressing cell, comprising 
contacting the cell with a leczyme that binds the ligand. 

42. The method of claim 41, wherein said leczyme is 

an MHC-derived gene product. 



20 43. The method of claim 41, wherein said leczyme is 

a non-MHC derived gene product. 
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44. A method of identifying a peptide that binds to 

a carbohydrate ligand binding site of a leczyme, 
comprising the steps of : 



5 



a. contacting a leczyme and a carbohydrate 
ligand known to bind the leczyme with a test sample 
containing a peptide to be identified; 

b. determining the amount of oligosaccharide 
ligand bound to the leczyme after said reacting; and 

c. comparing the amount of oligosaccharide 
10 ligand bound in the test sample with the amount of 

oligosaccharide ligand bound in a control sample, wherein 
a decreased amount of oligosaccharide ligand, bound in the 
test sample relative to the amount of oligosaccharide 
ligand bound in the control sample indicates binding of 
15 the peptide to the leczyme. 

45. The method of claim 44, wherein said peptide 

and said leczyme are contacted prior to contacting said 
oligosaccharide ligand. 

20 46 • The method of claim 44, wherein said 

oligosaccharide ligand or said leczyme contains a 
detectable label. 

47. The method of claim 44, wherein said leczyme or 

said oligosaccharide ligand is associated with a cell. 
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48. The method of claim 44 , wherein said test 
sample contains a peptide library. 

49. A method of modifying a cell to produce a 
carbohydrate ligand, comprising introducing an expression 

5 vector encoding a leczyme into the cell to obtain 
expression of the leczyme, wherein said expression 
results in production of the oligosaccharide ligand by 
the cell. 

50. The method of claim 49, wherein said leczyme is 
10 an MHC-derived gene product. 

51. The method of claim 49, wherein said leczyme is 
a non-MHC derived gene product. 

52 . A method for modulating an immune response in 
an individual to an antigen, comprising administering the 

15 oligosaccharide ligand of claim l and the antigen. 

53. The method of claim 52, wherein said 
administering results in an increase in the immune 
response to the antigen. 

54 . The method of claim 52 , wherein said 

20 administering results in a decrease in the immune 
response to the antigen. 

55. The method of claim 52, wherein said 
administering further comprises an immune suppressing 
agent . 
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56. The method of claim 52, wherein said antigen 
and said oligosaccharide ligand are covalently bonded. 

57. The method of claim 52, wherein an adjuvant is 
administered along with said antigen and said 

5 oligosaccharide ligand. 

58. a method for treating a disease state involving 
a leczyme, comprising administering an effective amount 
of the oligosaccharide ligand of claim 1. 

59. The method of claim 58, wherein said disease 
10 state is an MHC-linked disease. 

60. The method of claim 58, wherein said MHC-linked 
disease is an autoimmune disease. 

61. The method of claim 58, wherein said MHC-linked 
disease is hemochromatosis. 

15 62 • The method of claim 58, wherein said disease 

state is an infection. 

63- The method of claim 58, wherein said disease 

state is transplantation rejection. 

64. A method for treating a disease state involving 

20 a leczyme, comprising administering an effective amount 
of a leczyme having a similar binding specificity for a 
carbohydrate ligand as the leczyme involved in the 
disease state. 
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65. The method of claim 64, wherein said leczyme is 
an MHC-derived gene product. 

66. The method of claim 64, wherein said leczyme is 
a non-MHC derived gene product. 

5 67. The method of claim 64, wherein said disease 

state is an MHC- linked disease. 

68. The method of claim 67, wherein said MHC-linked 
disease is an autoimmune disease. 

69. The method of claim 64, wherein said MHC-linked 
10 disease is hemochromatosis . 

70. The method of claim 64, wherein said disease 
state is an infection. 

71. The method of claim 64, wherein said disease 
state is transplantation rejection. 

15 72 . A method for detecting a genetic predisposition 

for hemochromatosis, comprising detecting a mutation in 
the heavy chain of a class I MHC molecule that reduces 
the ability of said heavy chain to associate with fi 2 
microglobulin . 



20 73. The method of claim 72, wherein said mutation 

eliminates a signal for the addition of a phosphate 
group . 



WO 96/17870 

PCT/US95/I5913 

85 

74. The method of claim 72, wherein said mutation 

eliminates the ability of a phosphate group in said heavy 
chain to be de-phosphorylated for diagnosing 
hemochromatosis resulting from a reduction in the ability 
5 of a heavy chain of an MHC class I molecule to bind \ 
microglobulin, comprising the steps of: 

a. isolating a class I MHC heavy chain from 
an individual to be tested; 

b. contacting the heavy chain with &, 

10 microglobulin under conditions suitable for associating a 
class I MHC heavy chain with S, microglobulin; and 

c. detecting the association of said heavy 
chain with said £ 2 microglobulin, wherein a reduced 
association of said heavy chain with E 2 microglobulin 

15 compared to the association of a control heavy chain with 
S, microglobulin is diagnostic for hemochromatosis. 

75. a protein that has the ability to distinguish 
self from non-self through the complimentary interaction 
of the protein and a carbohydrate and is not an 

20 immunoglobulin molecule or a T cell receptor molecule. 

76. a method for treating a disease in a patient 
characterized by abnormal absorption of a metal from the 
patient's gastrointestinal tract, said method comprising 
the administration of a therapeutic dose of a 

25 carbohydrate which specifically binds to a protein coded 
by a gene within the IgGSF. 
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77. The method of claim 76, wherein the metal is 
iron. 

78. A method for treating an autoimmune disease in 
a patient comprising administering a therapeutic dose of 

5 a carbohydrate decoy, said decoy specifically binding to 
a protein coded by a gene within the IgGSF, 
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